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The present invention relates to non-pathogenic strains of HIV-1 and to components, 
parts, fragments and derivatives thereof and to genetic sequences derived therefrom and 
their use in the development of diagnostic and therapeutic compositions for the treatment 
and prophylaxis of AIDS and AIDS-related disorders. The present invention also relates 
to a method for attenuating pathogenic strains of HIV-1 by mutagenizing particular 
regions of the HIV-1 genome. 

Bibliographic details of the publications referred to in this specification are collected at 
the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the 
nucleotide and amino acid sequences referred to in the specification are defined 
following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated element or integer or group of elements or integers but 
not the exclusion of any other element or integer or group of elements or integers. 

Genomic nucleotide sequences of HIV-1 strains referred to herein are represented by 
their corresponding DNA sequence. 

Exemplary viral isolates referred to herein as "CI 8" and "C98" were deposited at the 
PHLS Centre for Applied Microbiology and Research, European Collection of Animal 
Cell Cultures (ECACC), Division of Biologies, Porton Down, Salisbury, Wiltshire SP4 
OJG. CI 8 was deposited on 17 October, 1994 under Provisional Accession Number 
V94101706 and C98 was deposited on 31 October, 1994 under Provisional Accession 
Number V941031169. Viral isolate "C54" was deposited at ECACC on 14 February, 
1995 under Provisional Accession Number . 
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A summary of particular deletion mutants of HIV-1 of the present invention referred to 
herein is given in Figure 1 1 . 

Acquired Immune Deficiency Syndrome (AIDS) and AIDS related disorders are the 
5 clinical result of infection by Human Immunodeficiency Virus type I (HIV-1) (Barre- 
Sinoussi et al, 1983). Infection by HIV-1 is generally characterised by progressive 
immune system damage (Teeuwsen et al, 1990; Clerici et ah 1989) leading to 
opportunistic infections, malignancies or wasting syndrome that constitute clinically- 
defined AIDS (Busch et ah 1991; Klaslow et al, 1990). 

10 

The high mortality rate of individuals infected with HIV-1 together with the social and 
economic consequences of the continuing HIV-1 epidemic has created an urgent need 
for a safe and effective treatment and/or prophylaxis against the devastating effects of 
AIDS. However, despite over a decade of high level scientific research into the 

15 pathogenesis of HIV-1 and the clinical manifestations of the disease, together with a 
detailed molecular analysis of the virus, there has been little success in the development 
of an effective vaccine. To date, the most effective therapy is treatment with zidovudine 
(AZT) which delays the onset of full blown AIDS and alleviates to some extent the 
symptoms of HIV-1 infection. However, AZT is not an innocuous compound and AZT, 

20 metabolic products thereof or impurities therein can cause a number of side effects 
which limit long term treatment with the drug. Furthermore, AZT resistant isolates have 
been reported during treatment. Clearly, therefore, a need exists to develop alternative 
strategies in preventing and treating HIV-1 infection. 

25 The initial phases of HIV-1 infection are summarised by Levy (1993) as involving 
attachment, fusion and nucleocapsid entry. These phases have been the traditional foci 
in research into development of antiviral strategies. The molecular events at the virus 
genomic level have also been the subject of intense scientific research with an aim being 
the development of a live attenuated vaccine as a possible approach for the treatment or 

30 prophylaxis of HIV-1 infection. 
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There is a high variable rate of progression from initial HIV-1 infection to AIDS which 
reflects a rapidly changing pathogen and variable immune response of the host to 
infection (Sheppard et al, 1993). With regards to the latter, HIV-1 can be considered 

5 as a heterogenous group of viruses differing at the genetic level with concomitant 
variable pathogenicity. For example, HIV-1 strains can differ in their capacity to kill 
cells. Furthermore, it appears that HIV-1 strains evolve in a host after infection and that 
the evolution varies depending on the tissues infected by the virus. The major sites in 
the genome apparently responsible for biological and pathological variation are the 

10 highly variable envelope region (Cheng-Mayer et al, 1991; Shioda et al, 1992; Hwang, 
et al 1991; Sullivan et al, 1993; Groenink et al, 1993) and the viral regulatory regions 
such as tat (Leguern et al, 1993). The genetic complexity of the HIV-1 group of viruses 
together with their variable pathogenicity, are major difficulties in the development of 
live vaccines, genetic vaccines or component vaccines. 

15 

Notwithstanding the highly pathogenic nature of HIV-1, there are some reports of long 
term survival of subjects infected with the virus (Learmont et al, 1992; Levy, 1993; 
Sheppard etal, 1993; Lifson et al 1991). It is not always clear, however, whether a 
benign course following HIV-1 infection is due to host factors, viral factors or other 
20 unknown factors. There are reports that most infected people have at least laboratory 
evidence of progressive immune system damage in the form of CD4+ cell loss (Lang 
et al, 1989) and defective immune responses (Clerici et al, 1989). 

Although simian monkeys have been used as an in vivo model for HIV and Simian 
25 Immunodeficiency Virus (SIV) infection, a major handicap in AIDS research has been 
the absence of suitable in vivo models to study the pathogenesis of the disease and, in 
particular, to study the viruses involved in benign infection. The need for a suitable in 
vivo model is heightened by the fact that results obtained in vitro cannot necessarily be 
extrapolated to what occurs in vivo. This was clearly observed by Mosier et al (1993) 
30 where conflicting results were obtained in animals compared to cell cultures. 
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Despite the absence of suitable in vivo models, considerable scientific research has been 
directed to attenuating fflV-1 strains by mutagenesis of the virus genome. Deletions in 
the nef gene have been implicated in attenuated strains of SIV and their use in providing 
5 protective effects in monkeys (Daniel et al, 1992). However, there are conflicting 
reports on the possible negative influence the nef gene product has on the rate or extent 
of virus replication (Terwilliger et al, 1986; Luciw et al, 1987; Niederman et al, 1989; 
Kim et al, 1989; Hammes et al, 1989). In fact, Kim et al (1989) found that nef Aid not 
affect HIV-1 replication or HIV-1 long terminal repeat (LTR)-driven CAT expression. 
10 Kestler III et al (1991) found that the nef gem is required for full pathogenic potential 
in SIV. However, such is the complexity of the HIV-1 group of viruses and the 
variability of immune responses between individuals let alone different species, that it 
is far from clear whether nef deleted strains of HIV-1 would behave similarly to nef 
deleted strains of SIV-I. There is a need, therefore, in order to investigate the possibility 
15 of. nef deleted HIV-1 strain as a vaccine candidate, to identify individuals infected with 
such modified viruses. 

Learmont et al (1992) reported that a cohort of five persons infected with blood products 
from a single HIV-1 infected donor have remained asymptomatic from up to about 10- 

20 14 years after infection. Subsequently, a sixth person has been identified as being part 
of the cohort. Both the donor and recipients were HIV-1 seropositive but with no 
indications of clinical symptoms of HIV-1 related disease and CD4+ cell number and 
P 2 -microglobulin levels have remained in the normal range. The identification of this 
cohort of benignly infected individuals provides a unique in vivo model in which the 

25 pathogenesis of HIV-1 infection can be studied at the clinical and molecular biological 
levels. 

However, it has not always possible using conventional isolation procedures to routinely 
and reproducibly isolate viral strains from the above mentioned donor or recipients 
30 which has frustrated efforts to investigate the cause of the asymptomatic individuals. 
In accordance with the present invention, methods have now been established to isolate 
viruses from the above individuals. It has been determined, in accordance with the 
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present invention, that the six individuals of the cohort are infected by non-pathogenic 
strains of HIV- 1 . Furthermore, the non-pathogenic strains of HIV- 1 carry one or more 
nucleotide mutations. The non-pathogenic strains of the present invention enable the 
generation of a range of therapeutic, diagnostic and targeting agents against HIV-1 
5 infection. The present invention also enables the attenuation of previously pathogenic 
strains of HIV- L 

Accordingly, one aspect of the present invention contemplates a non-pathogenic isolate 
of HIV-1 or a component, part, fragment or derivative thereof. 

10 

In a related embodiment, there is provided a novel isolate of HIV-1 or a component, 
part, fragment or derivative thereof wherein said HIV-1 isolate is capable of stimulating 
in a human or primate subject an immune response such as a humoral immune response 
to at least one HIV-1 glycoprotein such as but not limited to gp41-45, gpl20 and/or 
15 gpl60 while not substantially reducing in said human or primate subject proliferative 
responses and cytokine production to a mitogen, alloantigen and/or recall antigen 
compared to a healthy, non-infected human or primate subject. Preferably, the cytokine 
is IL-2. Preferably, the mitogen is ConA or PHA and the recall antigen is influenza or 
tetanus toxoid. Preferably, the HIV-1 isolate is non-pathogenic. 

20 

More particularly, the present invention relates to an isolated HIV-1 strain which: 

(i) is substantially non-pathogenic in human subjects; and 

(ii) carries one or more mutations in its genome resulting in the inability to direct 
synthesis of at least one pathogenic HIV-1 -derived polypeptide or protein. 

25 

Even more particularly, the present invention provides an isolated HIV-1 strain which: 

(i) is substantially non-pathogenic in human subjects; and 

(ii) carries a mutation in the nef gene and/or a long terminal repeat (LTR) region or 
in a functionally equivalent location in the HIV-1 genome. 
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Still even more particularly, the present invention is directed to an isolated virus which: 
(i) has a genome which is capable of hybridising under medium stringency 
conditions to complementary nucleic acid from a pathogenic strain of HIV-1; 
5 (ii) is substantially non-pathogenic in human subjects; 

(iii) carries one or more deletion mutations in a region of its genome corresponding 
to a nef gene in said pathogenic strain of HIV-1; and 

(iv) optionally carries a mutation in one or both LTR regions. 

10 In a related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
conditions to complementary nucleic acid from a pathogenic strain of HIV- 1 ; 

(ii) is substantially non-pathogenic in human subjects; 

,(m) carries one or more deletion mutations in an LTR region of its genome; and 
15 (iv) optionally carries a mutation in a region corresponding to a nef gene in said 
pathogenic strain of HIV-1. 

In a further related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
20 conditions to complementary nucleic acid from a pathogenic strain of HIV-1; 

(ii) is substantially non-pathogenic in human subjects; and 

(iii) carries one or more deletion mutations in a region of its genome corresponding 
to a region which contains nef coding sequences and LTR nucleotide sequences. 

25 In a particularly preferred embodiment, the present invention provides non-pathogenic 
HIV-1 isolate CI 8 deposited at the ECACC on 17 October, 1994 under Provisional 
Accession Number V941 01706. 



30 



In a related embodiment, the present invention provides non-pathogenic HIV-1 isolate 
C98 deposited at the ECACC on 31 October, 1994 under Provisional Accession Number 
V941031169. 
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In another embodiment, the present invention provides non-pathogenic HIV-1 isolate 
C54 deposited at ECACC on 14 February, 1995 under Provisional Accession No. . 

Although pathogenicity is a relative term, it is used herein in relation to the capacity of 
5 a strain of HIV-1 to induce AIDS or AIDS-related disorders in an individual over time. 
Accordingly, a "non-pathogenic" strain of HIV-1 is a strain which, at the clinical level, 
does not lead to the development of AIDS, at least within the median time of 6-10 years 
following infection with HIV-1. At the laboratory level, a non-pathogenic strain of 
HIV-1 is considered not to alter CD4+ cell counts or p 2 -microglobulin concentrations. 

10 In addition, a non-pathogenic strain of HIV-1 may not alter CD8+ and CD3+ cell counts 
and would not alter lymphocyte counts. CD4+;CD8+ ratios also remain unchanged 
relative to normal non-infected individuals. Furthermore, generally, a non-pathogenic 
strain of HIV-1 does not induce p24 antigenaemia. A non-pathogenic HIV-1 of the 
present invention is generally still infectious but individuals infected with the virus 

15 remain free of symptoms for at least 6-10 years after infection. - 

A laboratory classified non-pathogenic strain of HIV-1 may be determined at any time 
after infection. The term "non-pathogenic" is not to be considered as a strain that is 
never pathogenic under any conditions as this might depend on the host individual, the 

20 level of immune responsiveness in that individual and the extent or otherwise of other, 
for example, immune comprising disorders. Accordingly, a M non-pathogenic" HIV-1 
isolate of the present invention may also be considered a "low virulent" strain of the 
virus. A non-pathogenic strain of HIV-1 as contemplated herein may be isolated from 
an asymptomatic individual or may be derived from a pathogenic strain by mutation. 

25 Although the present invention is not to be limited to any particular pathogenic strain 
of HIV-1, for reference purposes, an example of a pathogenic strain is HIV-1 NL4-3 
strain as described by Myers et al (1992). 

The non-pathogenic nature of the HIV-1 of the present invention is conveniently 
30 evidenced by the cohort of seven individuals comprising one donor and six recipients 
which have remained free of symptoms or signs of HIV-1 infection for greater than the 
median time of 6-10 years. However, the individuals of the cohort are seropositive for 
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HIV-1 following infection with the virus as determined by Western blot analysis and 
genetic analysis (e.g. using PCR techniques). A seropositive individual is one showing 
reactivity to at least one HIV-1 glycoprotein (such as but not limited to gp 41-45, gpl20, 
gpl60) and at least three other virus-specific bands. 

5 

In accordance with the present invention, a non-pathogenic HIV-1 isolate is also a strain 
of HIV-1 which: 

(i) induces an immune response in a human or primate subject; and 

(ii) does not substantially reduce proliferative responses or cytokine production to a 
10 mitogen, alloantigen and/or recall antigen relative to a healthy, non-infected 

subject. 

Preferably, the immune response such as to a glycoprotein, for example gp41-45, gpl20 
and/or gpl60. Preferably, the cytokine monitored is an interleukin, such as IL-2. 
15 Preferably, the recall antigen is influenza or tetanus toxoid. A non-pathogenic HIV-1 
isolate is also one which: 

(iii) does not substantially alter proliferative responses or cytokine production to 
allorgeneic mononuclear cells. 



20 The genomes or complementary DNA therefrom of the non-pathogenic HIV-1 isolates 
of the present invention are capable of hybridising under medium stringency conditions 
to the corresponding genome or complementary DNA of a pathogenic strain of HIV-1 
(e.g. HIV-1 strain NL4-3). The ability to hybridise to a pathogenic strain of HIV-1 only 
applies to a comparison of the entire genome/complementary DNA of a non-pathogenic 

25 strain or a fragment which includes genetic material corresponding to a region in the 
genome 3' of the nef gene in a pathogenic strain of HIV-1. 



For the purposes of reference only, a suitable genomic nucleotide sequence from a 
pathogenic HIV-1 strain is set forth in SEQ ID NO: 1 from HIV-1 strain NL4-3 (Myers 
30 et ah 1992): 

1 TGGAAGGGCTAATTTGGTCCCAAAAAAGACAAGAGATCCTTGATCTGTGG 
51 ATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGG 
101 GCCAGGGATCAGATATCCACTGACCTTTGGATGGTGCTTCAAGTTAGTAC 
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151 CAGTTGAACCAGAGCAAGTAGAAGAGGCCAAATAAGGAGAGAAGAACAGC 
201 TTGTTACACCCTATGAGCCAGCATGGGATGGAGGACCCGGAGGGAGAAGT 

2 51 ATTAGTGTGGAAGTTTGACAGCCTCCTAGCATTTCGTCACATGGCCCGAG 

3 01 AGCTGCATCCGGAGTACTACAAAGACTGCTGACATCGAGCTTTCTACAAG 
5 3 51 GGACTTTCCGCTGGGGACTTTCCAGGGAGGTGTGGCCTGGGCGGGACTGG 

4 01 GGAGTGGCGAGCCCTCAGATGCTACATATAAGCAGCTGCTTTTTGCCTGT 
451 ACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCTA 
501 ACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTCA 

5 51 AAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTC 
10 6 01 AGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGG 

651 GACTTGAAAGCGAAAGTAAAGCCAGAGGAGATCTCTCGACGCAGGACTCG 
701 GCTTGCTGAAGCGCX3CACGGCAAGAGGCGAGGGGCGGCGACTGGTGAGTA 
751 CGCCAAAAATTTTGAGTAGCGGAGGCTAGAAGGAGAGAGATGGGTGCGAG 
801 AGCGTCGGTATTAAGCGGGGGAGAATTAGATAAATGGGAAAAAATTCGGT 
15 851 TAAGGCCAGGGGGAAAGAAACAATATAAACTAAAACATATAGTATGGGCA 

901 AGCAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTTTTAGAGACATC 
951 AGAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAG 
10 01 GATCAGAAGAACTTAGATCATTATATAATACAATAGCAGTCCTCTATTGT 
1051 GTGCATCAAAGGATAGATGTAAAAGACACCAAGGAAGCCTTAGATAAGAT 
20 1101 AGAGGAAGAGCAAAACAAAAGTAAGAAAAAGGCACAGCAAGCAGCAGCTG 

1151 ACACAGGAAACAACAGCCAGGTCAGCCAAAATTACCCTATAGTGCAGAAC 

12 01 CTCCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGC 
1251 ATGGGTAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTAATACCCA 

13 01 TGTTTTCAGCATTATCAGAAGGAGCCACCCCACAAGATTTAAATACCATG 
25 13 51 CTAAACACAGTGGGGGGACATCAAGCAGCCATGCAAATGTTAAAAGAGAC 

14 01 CATCAATGAGGAAGCTGCAGAATGGGATAGATTGCATCCAGTGCATGCAG 
1451 GGCCTATTGCACCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCA 
1501 GGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACACATAATCC 
1551 ACCTATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAA 

30 1601 ATAAAATAGTAAGAATGTATAGCCCTACCAGCATTCTGGAGATAAGACAA 

1651 GGACCAAAGGAACCCTTTAGAGACTATGTAGACCGATTCTATAAAACTCT 
1701 AAGAGCCGAGCAAGCTTCACAAGAGGTAAAAAATTGGATGACAGAAACCT 
1751 TGTTGGTCCAAAATGCGAACCCAGATTGTAAGACTATTTTAAAAGCATTG 
1801 GGACCAGGAGCGACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGG 

35 18 51 GGGACCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCAATGAGCCAAGTAA 

1901 CAAATCCAGCTACCATAATGATACAGAAAGGCAATTTTAGGAACCAAAGA 
1951 AAGACTGTTAAGTGTTTCAATTGTGGCAAAGAAGGGCACATAGCCAAAAA 
2001 TTGCAGGGCCCCTAGGAAAAAGGGCTGTTGGAAATGTGGAAAGGAAGGAC 
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2 051 ACCAAATGAAAGATTGTACTGAGAGACAGGCTAATTTTTTAGGGAAGATC 
2101 TGGCCTTCCCACAAGGGAAGGCCAGGGAATTTTCTTCAGAGCAGACCAGA 
2151 GCCAACAGCCCCACCAGAAGAGAGCTTCAGGTTTGGGGAAGAGACAACAA 
.2201 CTCCCTCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTTTAGCT 
5 2251 TCCCTCAGATCACTCTTTGGCAGCGACCCCTCGTCACAATAAAGATAGGG 

23 01 GGGCAATTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATT 

23 51 AGAAGAAATGAATTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGAA 

24 01 TTGGAGGTTTTATCAAAGTAGGACAGTATGATCAGATACTCATAGAAATC 
24 51 TGCGGACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAA 

10 2 501 CATAATTGGAAGAAATCTGTTGACTCAGATTGGCTGCACTTTAAATTTTC 

2 551 C C ATT AGT C CT ATTG AGACTGT AC C AGT AAAATT AAAG CCAGG AATGG AT 
2 601 GGCCCAAAAGTTAAACAATGGCCATTGACAGAAGAAAAAATAAAAGCATT 
2 651 AGTAGAAATTTGTACAGAAATGGAAAAGGAAGGAAAAATTTCAAAAATTG 
2701 GGCCTGAAAATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGAC 

15 2751 AGTACTAAATGGAGAAAATTAGTAGATTTCAGAGAACTTAATAAGAGAAC 

2 8 01 TCAAGATTTCTGGGAAGTTCAATTAGGAATACCACATCCTGCAGGGTTAA 
2 8 51 AACAGAAAAAATCAGTAACAGTACTGGATGTGGGCGATGCATATTTTTCA 
2 901 GTTCCCTTAGATAAAGACTTCAGGAAGTATACTGCATTTACCATACCTAG 

2 951 TATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGCTTCCAC 
20 3 001 AGGGATGGAAAGGATCACCAGCAATATTCCAGTGTAGCATGACAAAAATC 

3 051 TTAGAGCCTTTTAGAAAACAAAATCCAGACATAGTCATCTATCAATACAT 
3101 GGATGATTTGTATGTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAA 
3151 AAATAGAGGAACTGAGACAACATCTGTTGAGGTGGGGATTTACCACACCA 
32 01 GACAAAAAACATCAGAAAGAACCTCCATTCCTTTGGATGGGTTATGAACT 

25 3 251 CCATCCTGATAAATGGACAGTACAGCCTATAGTGCTGCCAGAAAAGGACA 

3 3 01 GCTGGACTGTCAATGACATACAGAAATTAGTGGGAAAATTGAATTGGGCA 
3 3 51 AGTCAGATTTATGCAGGGATTAAAGTAAGGCAATTATGTAAACTTCTTAG 
34 01 GGGAAC CAAAG CACTAACAG AAGT AGT AC CA CT AACAGAAGAAG C AG AG C 
34 51 TAGAACTGGCAGAAAACAGGGAGATTCTAAAAGAACCGGTACATGGAGTG 

30 3 5 01 TATTATGACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGGGCA 

3 551 AGGCGAATGGACATATGAAATTTATCAAGAGCCATTTAAAAATCTGA 
3 6 01 CAGGAAAATATGCAAGAATGAAGGGTGCCCACACTAATGATGTGAAACAA 
3 651 TTAACAGAGGCAGTACAAAAAATAGCCACAGAAAGCATAGTAATATGGGG 
3 701 AAAGACTCCTAAATTTAAATTACCGATACAAAAGGAAACATGGGAAGCAT 

35 3 751 GGTGGACAGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGTC 

3 8 01 AATACCCCTCCCTTAGTGAAGTTATGGTACCAGTTAGAGAAAGAACCCAT 
3 851 AATAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCCAATAGGGAAACTA 
3 901 AATTAGGAAAAGCAGGATATGTAACTGACAGAGGAAGACAAAAAGTTGTC 
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3 951 CCCCTAACGGACACAACAAATCAGAAGACTGAGTTACAAGCAATTCATCT 

4 0 01 AGCTTTGCAGGATTCGGGATTAGAAGTAAACATAGTGACAGACTCACAAT 
4 051 ATGCATTGGGAATCATTCAAGCACAACCAGATAAGAGTGAATCAGAGTTA 
4101 GTCAGTCAAATAATAGAGCAGTTAATAAAAAAGGAAAAAGTCTACCTGGC 
4151 ATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAAGTAGATGGGT 
4201 TGGTCAGTGCTGGAATCAGGAAAGTAGTATTTTTAGATGGAATAGATAAG 
4251 GCCCAAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAG 

43 01 TGATTTTAACCTACCACCTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTG 
4351 ATAAATGTCAGCTAAAAGGGGAAGCCATGCATGGACAAGTAGACTGTAGC 

44 01 CCAGGAATATGGCAGCTAGATTGTACACATTTAGAAGGAAAAGTTATCTT ~ 
44 51 GGTAGCAGTTCATGTAGCCAGTGGATATATAGAAGCAGAAGTAATTCCAG 
4501 CAGAGACAGGGCAAGAAACAGCATACTTCCTCTTAAAATTAGCAGGAAGA 
4551 TGGCCAGTAAAAACAGTACATACAGACAATGGCAGCAATTTCACCAGTAC 
4601 TACAGTTAAGGCCGCCTGTTGGTGGGCGGGGATCAAGCAGGAATTTGGCA 
4651 TTCCCTACAATCCCCAAAGTCAAGGAGTAATAGAATCTATGAATAAAGAA 

4 701 TTAAAGAAAATTATAGGACAGGTAAGAGATCAGGCTGAACATCTTAAGAC 
4 751 AGCAGTACAAATGGCAGTATTCATCCACAATTTTAAAAGAAAAGGGGGGA 
4 801 TTGGGGGGTACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGAC 
4 851 ATACAAACTAAAGAATTACAAAAACAAATTACAAAAATTCAAAATTTTCG 
4 901 GGTTTATTACAGGGACAGCAGAGATCCAGTTTGGAAAGGACCAGCAAAGC 

4 951 TCCTCTGGAAAGGTGAAGGGGCAGTAGTAATACAAGATAATAGTGACATA 

5 001 AAAGTAGTGCCAAGAAGAAAAGCAAAGATCATCAGGGATTATGGAAAACA 
5051 GATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAACACA 
5101 TGGAAAAGATTAGTAAAACACCATATGTATATTTCAAGGAAAGCTAAGGA 
5151 CTGGTTTTATAGACATCACTATGAAAGTACTAATCCAAAAATAAGTTCAG 
5201 AAGTACACATCCCACTAGGGGATGCTAAATTAGTAATAACAACATATTGG 
5251 GGTCTGCATACAGGAGAAAGAGACTGGCATTTGGGTCAGGGAGTCTGCAT 
5301 AGAATGGAGGAAAAAGAGATATAGCACACAAGTAGACCCTGACCTAGCAG 
5351 ACCAACTAATTCATCTGCACTATTTTGATTGTTTTTCAGAATCTGCTATA 
54 01 AGAAATACCATATTAGGACGTATAGTTAGTCCTAGGTGTGAATATCAAGC 
5451 AGGACATAACAAGGTAGGATCTCTACAGTACTTGGCACTAGCAGCATTAA 
5501 TAAAACCAAAACAGATAAAGCCACCTTTGCCTAGTGTTAGGAAACTGACA 
5 551 GAGGACAGATGGAACAAGCCCCAGAAGACCAAGGGCCACAGAGGGAGCCA 
5601 TACAATGAATGGACACTAGAGCTTTTAGAGGAACTTAAGAGTGAAGCTGT 
5651 TAGACATTTTCCTAGGATATGGCTCCATAACTTAGGACAACATATCTATG 
5701 AAACTTACGGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATTCTG 
5751 CAACAACTGCTGTTTATCCATTTCAGAATTGGGTGTCGACATAGCAGAAT 
5801 AGGCGTTACTCGACAGAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGA 
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5 8 51 CTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGGTTGTACCAA 
5 901 TTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATGACAA 

5 951 AAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGA 

6 001 GCTCATCAGAACAGTCAGACTGATCAAGCTTCTCTATCAAAGCAGTAAGT 
5 6 051 AGTACATGTAATGCAACCTATAATAGTAGCAATAGTAGCATTAGTAGTAG 

6101 CAATAATAATAGCAATAGTTGTGTGGTCCATAGTAATCATAGAATATAGG 
6151 AAAATATTAAGACAAAGAAAAATAGACAGGTTAATTGATAGACTAATAGA 
62 01 AAGAGCAGAAGACAGTGGCAATGAGAGTGAAGGAGAAGTATCAGCACTTG 
6251 TGGAGATGGGGGTGGAAATGGGGCACCATGCTCCTTGGGATATTGATGAT 

10 63 01 CTGTAGTGCTACAGAAAAATTGTGGGTCACAGTCTATTATGGGGTACCTG 

6351 TGTGGAAGGAAGCAACCACCACTCTATTTTGTGCATCAGATGCTAAAGCA 
64 01 TATGATACAGAGGTACATAATGTTTGGGCCACACATGCCTGTGTACCCAC 
64 51 AGACCCCAACCCACAAG7LAGTAGTATTGGTAAATGTGACAGAAAATTTTA 
6501 ACATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGT 

15 6551 TTATGGGATCAAAGCCTAAAGCC^TGTGTAAAATTAACCCCACTCTGTGT 

66 01 TAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACCAATAGTAGTA 
6651 GCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAAT 
6 701 ATCAGCACAAGCATAAGAGATAAGGTGCAGAAAGAATATGCATTCTTTTA 
6 751 TAAACTTGATATAGTACCAATAGATAATACCAGCTATAGGTTGATAAGTT 

20 6801 GTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTTGAGCCA 

6 8 51 ATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAA 
6 901 TAATAAGACGTTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTAC 
6 951 AATGTACACATGGAATCAGGCCAGTAGTATCAACTCAACTGCTGTTAAAT 
70 01 GGCAGTCTAGCAGAAGAAGATGTAGTAATTAGATCTGCCAATTTCACAGA 

25 7051 GAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATT 

7101 GTAGAAGACCCAACAAGAATACAAGAAAAAGTATCCGTATCCAGAGGGGA 
7151 CCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGC 
7201 ACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAG 

72 51 CTAGCAAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAG 
30 73 01 CAATCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTTAATTGTGG 

73 51 AGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGTACTTGGT 

74 01. TTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGAC 
74 51 ACAATCACACTCCCATGCAGAATAAAACAATTTATAAACATGTGGCAGGA 
7501 AGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGATGTT 

35 7551 CATCAAATATTACTGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

7601 AATGGGTCCGAGATCTTCAGACCTGGAGGAGGCGATATGAGGGACAATTG 
7651 GAGAAGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAG 
7701 TAG CAC C CAC CAAGGCAAAG AG AAG AGTGGTG CAG AG AG AAAAAAG AG CA 
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7751 GTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCAC 
7 8 01 TATGGGCTGCACGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGT 
7851 CTGATATAGTGCAGCAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAA 

7 9 01 CAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAACAGCTCCAGGCAAG 
5 7951 AATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTT 

8 0 01 GGGGTTGCTCTGGAAAACTCATTTGCAGCACTGCTGTGCCTTGGAATGCT 
8 0 51 AGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGGAT 
8101 GGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCCTTAA 
8151 TTGAAGAATCGCAAAACCAGCAAGAAAAGAATGAACAAGAATTATTGGAA 

10 8201 TTAGATAAATGGGCAAGTTTGTGGAATTGGTTTAACATAACAAATTGGCT 

8251 GTGGTATATAAAATTATTCATAATGATAGTAGGAGGCTTGGTAGGTTTAA 

83 01 GAATAGTTTTTGCTGTACTTTCTATAGTGAATAGAGTTAGGCAGGGATAT 
8351 TCACCATTATCGTTTCAGACCCACCTCCCAATCCCGAGGGGACCCGACAG 

84 01 GCCCGAAGGAATAGAAGAAGAAGGTGGAGAGAGAGACAGAGACAGATCCA 
15 8451 TTCGATTAGTGAACGGATCCTTAGCACTTATCTGGGACGATCTGCGGAGC 

8501 CTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTTGATTGTAAC 
8551 GAGGATTGTGGAACTTCTGGGACGCAGGGGGTGGGAAGCCCTCAAATATT 
8601 GGTGGAATCTCCTACAGTATTGGAGTCAGGAACTAAAGAATAGTGCTGTT 

8 651 AACTTGCTCAATGCCACAGCCATAGCAGTAGCTGAGGGGACAGATAGGGT 
20 8 701 TATAGAAGTATTACAAGCAGCTTATAGAGCTATTCGCCACATACCTAGAA 

8751 GAATAAGACAGGGCTTGGAAAGGATTTTGCTATAAGATGGGTGGCAAGTG 
88 01 GTCAAAAAGTAGTGTGATTGGATGGCCTGCTGTAAGGGAAAGAATGAGAC 
88 51 GAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGTATCTCGAGACCTAGAA 
8901 AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTAACAATGCTGCTTG 
25 8 951 TGCCTGGCTAGAAGCACAAGAGGAGGAAGAGGTGGGTTTTCCAGTCACAC 

9 0 01 CTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGC 
9 0 51 CACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAG 
9101 AAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCC 
9151 CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACC 

30 92 01 TTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGA 

92 51 GGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATG 

93 01 GAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGC 

93 51 CTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAA 

94 01 CTGCTGACATCGAGCTTGCTACAAGGGACTTTCCGCTGGGGACTTTCCAG 
35 94 51 GGAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGAGCCCTCAGATGCTGC 

9501 ATATAAGCAGCTGCTTTTTGCCTGTACTGGGTCTCTCTGGTTAGACCAGA 
9551 TCTGAGCCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCT 
96 01 CAATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTG 
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.9651. TGACTCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAA 
97 01 TCTCTAGCA 

However, for the purposes of comparing the nucleotide sequences of non-pathogenic 
5 HIV-1 strains including the ability to hybridise to a reference strain, the present 
invention extends to a genomic nucleotide sequence from any pathogenic strain of HI V- 
1. 

Accordingly, in a particularly preferred embodiment, there is provided a viral isolate 
10 which: 

(i) carries a genome which is capable of hybridising under medium stringency 
conditions to SEQ ID NO: 1 or a complementary form thereof or an analogous 
sequence from another pathogenic strain of HIV-1; and 

(ii) carries a deletion mutation in a region corresponding to the we/gene and/of in 
15 an LTR region. Generally, such an HIV-1 isolate is non-pathogenic as 

hereinbefore defined. 

In a related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
20 conditions to complementary nucleic acid from a pathogenic strain of HIV-1; 

and 

(ii) carries one or more deletion mutations in a region of its genome corresponding 
to a region which contains nef coding sequences and LTR nucleotide sequences. 



25 For the purposes of defining the level of stringency, reference can conveniently be made 
to Maniatis et al (1982) at pages 387-389 which is herein incorporated by reference 
where the washing steps disclosed are considered high stringency. A low stringency is 
defined herein as being in 1-3X SSC/0. 1-0.5% w/v SDS at 37-50°C for 2-3 hours. 
Depending on the source and concentration of nucleic acid involved in the hybridisation, 

30 alternative conditions of stringency may be employed such as medium stringent 
conditions which are considered herein to be 0.1-1X SSC/0.25-0.5% w/v SDS at > 45°C 
for 2-3 hours or high stringent conditions considered herein to be 0.1 -IX SSC/0. 1% w/v 
SDS at 60°C for 1-3 hours. 
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In a particularly preferred embodiment of the present invention, the non-pathogenic 
strain of HIV- 1 carries a mutation in the nef gene and/or LTR region of the genome. 

A "mutation" is considered herein to include a single or multiple nucleotide substitution, 
5 deletion and/or addition. Most preferred mutations are single or multiple deletions of 
at least one, most preferably at least ten and even more preferably at least twenty 
contiguous nucleotides from a region corresponding to the nef gene and/or the LTR 
region. When the non-pathogenic virus carries a mutation in the LTR region, this 
generally occurs 5' of the Spl sites. 

10 

According to a preferred aspect of the present invention, there is provided a viral isolate 
which: 

(i) is reactive to antibodies to a glycoprotein from HIV-1 such as at least one of 
gp41-45, gpl20 and/or gpl60; 

1 5 (ii) is substantially non-pathogenic in human subjects; and 

(iii) carries a deletion mutation of at least ten nucleotides in a region corresponding 
to the nef gene and/or LTR region of a pathogenic strain of HIV-1. 

In another embodiment, there is provided a viral isolate which: 
20 (i) is capable of inducing an immune response to at least one of gag pol and/or env; 

(ii) is substantially non-pathogenic in human subjects; and 

(iii) carries a deletion mutation of at least ten nucleotides in a region corresponding 
to the we/ gene and/or LTR region of a pathogenic strain of HIV-1. 

25 Preferably, in respect of the latter embodiment, the immune response is an antibody or 
a cell mediated response. In a most preferred embodiment, the immune response is a 
humoral immune response. 

The nucleotide sequence of the nef gene in HIV-1 NL4-3 is defined in SEQ ID NO: 
30 650: 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGATTGGATGGCCTGCTGTAAGGGAAAGAAT 
GAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGTATCTCGAGACCTAGAAAAAC 
ATGGAGCAATCACAAGTAGCAATACAGCAGCTAACAATGCTGCTTGTGCCTGGCTAGAA 



WO 95/21912 

- 16 - 

GCACAAGAGGAGGAAGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAAT 
GACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAG 
GGCTAATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAA 
GGGTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGAC 
5 CTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATA 
AAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGA 

GCTGCATCCGGAGTACTTCAAGAACTGCTGA 

10 The present invention extends to any or all single or multiple nucleotide deletions to a 
contiguous series of at least ten nucleotides from the He/gene which render the strain 
avirulent. The deletions may encompass the entire gene or parts thereof and may 
represent a single deletion or two or more deletions. Put in alternative terms, the non- 
pathogenic fflV-1 isolates of the present invention comprise a nucleotide sequence at the 

15 corresponding «e/gene region non-identifiable to SEQ ID NO: 650, said non-identity 
comprising at least 5%, more preferably at least 10% and even more preferably at least 
20% variation thereon. 

In a preferred embodiment, therefore, the present invention contemplates a viral isolate 
20 which: 

(i) is reactive to antibodies to a glycoprotein from HIV-1 such as at least one of 
gp41-45, gpl20 and/or gpl60; 

(ii) carries a genome or a part or fragment thereof capable of hybridising under 
medium stringency conditions to a nucleotide sequence as set forth in SEQ ID 

25 NO: 1 or a complementary form thereof or an analogous sequence from another 

pathogenic strain of HIV-1; 

(iii) carries a deletion of at least ten nucleotides in a region corresponding to the nef 
gene in HIV-1 NL4-3; and 

wherein said deletion encompasses one or more of the following decanucleotides from 
30 the ne/gene of HIV- 1 NL4-3 or corresponding sequences from another pathogenic strain 
of HIV-1: 

ATGGGTGGCA ( SEQ ID NO: 2); TGGGTGGCAA ( SEQ ID NO : 3); 
GGGTGGCAAG (SEQ ID NO: 4); GGTGGCAAGT (SEQ . ID NO : 5) ; 
GTGGCAAGTG ( SEQ ID NO: 6); TGGCAAGTGG ( SEQ ID NO: 7); 
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GGCAAGTGGT ( SEQ ID NO: 8); 
CAAGTGGTCA ( SEQ ID NO: 10) ; 
AGTGGTCAAA ( SEQ ID NO: 12) ; 
TGGTCAAAAA ( SEQ ID NO: 14); 
5 GTCAAAAAGT (SEQ ID NO: 16); 
CAAAAAGTAG (SEQ ID NO: 18); 
AAAAGTAGTG (SEQ ID NO: 20); 
AAGTAGTGTG ( SEQ ID NO: 22) ; 
GTAGTGTGAT (SEQ ID NO: 24); 
10 AGTGTGATTG ( SEQ ID NO: 26); 
TGTGATTGGA ( SEQ ID NO: 28); 
TGATTGGATG ( SEQ ID NO: 30); 
ATTGGATGGC ( SEQ ID NO: 32); 
TGGATGGCCT ( SEQ ID NO: 34); 
15 GATGGCCTGC (SEQ ID NO: 36); 
TGGCCTGCTG (SEQ ID NO: 38); 
GCCTGCTGTA (SEQ ID NO : 40); 
CTGCTGTAAG (SEQ ID NO: 42); 
GCTGTAAGGG ( SEQ ID NO: 44); 
20 TGTAAGGGAA ( SEQ ID NO: 46); 
TAAGGGAAAG ( SEQ ID NO : 48); 
AGGGAAAGAA ( SEQ ID NO : 50); 
GGAAAGAATG ( SEQ ID NO : 52); 
AAAGAATGAG ( SEQ ID NO : 54); 
25 AGAATGAGAC ( SEQ ID NO : 56); 
AATGAGACGA (SEQ ID NO: 58); 
TGAGACGAGC ( SEQ ID NO : 60); 
AGACGAGCTG (SEQ ID NO : 62); 
ACGAGCTGAG (SEQ ID NO : 64); 
30 GAGCTGAGCC (SEQ ID NO : 66); 
GCTGAGCCAG ( SEQ ID NO: 68); 
TGAGCCAGCA ( SEQ ID NO: 70) ; 
AGCCAGCAGC ( SEQ ID NO : 72); 
CCAGCAGCAG ( SEQ ID NO: 74); 
35 AGCAGCAGAT ( SEQ ID NO : 76); 
CAGCAGATGG (SEQ ID NO : 78) ; 
GCAGATGGGG ( SEQ ID NO: 80); 
AGATGGGGTG (SEQ ID NO : 82); 
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GCAAGTGGTC (SEQ ID NO: 9); 
AAGTGGTCAA (SEQ ID NO: 11) 
GTGGTCAAAA ( SEQ ID NO: 13) 
GGTCAAAAAG ( SEQ ID NO: 15) 
TCAAAAAGTA ( SEQ ID NO: 17) 
AAAAAGTAGT (SEQ ID NO: 19) 
AAAGTAGTGT ( SEQ ID NO : 21) 
AGTAGTGTGA (SEQ ID NO: 23) 
TAGTGTGATT (SEQ ID NO: 25) 
GTGTGATTGG (SEQ ID NO: 27) 
GTGATTGGAT ( SEQ ID NO: 29) 
GATTGGATGG ( SEQ ID NO: 31) 
TTGGATGGCC (SEQ ID NO: 33) 
GGATGGCCTG ( SEQ ID NO: 35) 
ATGGCCTGCT (SEQ ID NO: 37) 
GGCCTGCTGT ( SEQ ID NO : 3 9), 
CCTGCTGTAA (SEQ ID NO: 41) , 
TGCTGTAAGG (SEQ ID NO : 43), 
CTGTAAGGGA ( SEQ ID NO: 45); 
GTAAGGGAAA ( SEQ ID NO: 47); 

AAGGGAAAGA ( SEQ ID NO: 49); 
GGGAAAGAAT (SEQ ID NO : 51); 

GAAAGAATGA ( SEQ ID NO: 53); 

AAGAATGAGA ( SEQ ID NO : 55); 

GAATGAGAGG ( SEQ ID NO : 57) ; 

ATGAGACGAG (SEQ ID NO : 59); 

GAGACGAGCT ( SEQ ID NO: 61); 

GACGAGCTGA ( SEQ ID NO : 63); 

CGAGCTGAGC (SEQ ID NO : 65) ; 

AGCTGAGCCA ( SEQ ID NO: 67) ; 

CTGAGCCAGC (SEQ ID NO: 69); 

GAGCCAGCAG ( SEQ ID NO: 71); 

GCCAGCAGCA ( SEQ ID NO: 73) ; 

CAGCAGCAGA ( SEQ ID NO : 75); 

GCAGCAGATG ( SEQ ID NO : 77); 

AGCAGATGGG ( SEQ ID NO : 79); 

CAGATGGGGT ( SEQ ID NO : 81); 

GATGGGGTGG ( SEQ ID NO: 83); 
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ATGGGGTGGG (SEQ ID NO: 
GGGGTGGGAG (SEQ ID NO : 
GGTGGGAGCA (SEQ ID NO: 
TGGGAGCAGT (SEQ ID NO: 
5 GGAGCAGTAT (SEQ ID NO: 
AGCAGTATCT (SEQ ID NO: 
CAGTATCTCG (SEQ ID NO: 
GTATCTCGAG (SEQ ID NO: 
ATCTCGAGAC (SEQ ID NO: 
10 CTCGAGACCT (SEQ ID NO: 
CGAGACCTAG (SEQ ID NO: 
AGACCTAGAA (SEQ ID NO: 
ACCTAGAAAA (SEQ ID NO: 
CTAGAAAAAC (SEQ ID NO: 
15 AGAAAAACAT ( SEQ ID NO: 
AAAAACATGG (SEQ ID NO: 
AAACATGGAG (SEQ ID NO: 
ACATGGAGCA ( SEQ ID NO: 
ATGGAGCAAT (SEQ ID NO: 
20 GGAGCAATCA ( SEQ ID NO: 
AGCAATCACA ( SEQ ID NO: 
CAATCAGAAG (SEQ ID NO: 
ATCACAAGTA ( SEQ ID NO: 
CACAAGTAGC ( SEQ ID NO: 
25 CAAGTAGCAA ( SEQ ID NO: 
AGTAGCAATA ( SEQ ID NO : 
TAGCAATACA ( SEQ ID NO: 
GCAATACAGC ( SEQ ID NO: 
AATACAGCAG ( SEQ ID NO: 
30 TACAGCAGCT ( SEQ ID NO: 
CAGCAGCTAA ( SEQ ID NO: 
GCAGCTAACA ( SEQ ID NO: 
AGCTAACAAT (SEQ ID NO: 
CTAACAATGC ( SEQ ID NO: 
35 AACAATGCTG ( SEQ ID NO: 
CAATGCTGCT (SEQ ID NO 
ATGCTGCTTG ( SEQ ID NO 
GCTGCTTGTG ( SEQ ID NO 
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84); TGGGGTGGGA ( SEQ ID NO : 85) 
86); GGGTGGGAGC ( SEQ ID NO : 87) 
88); GTGGGAGCAG ( SEQ ID NO : 89) 
90); GGGAGCAGTA (SEQ ID NO : 91) 
92); GAGCAGTATC (SEQ ID NO: 93) 
94); GCAGTATCTC ( SEQ ID NO: 95) 
96); AGTATCTCGA (SEQ ID NO : 97) 
98) ; TATCTCGAGA ( SEQ ID NO: 99) 
100); TCTCGAGACC (SEQ ID NO : 101 
102); TCGAGACCTA (SEQ ID NO: 103) 
104); GAGACCTAGA (SEQ ID NO: 105) 
106) ; GACCTAGAAA ( SEQ ID NO: 107) 
108); CCTAGAAAAA ( SEQ ID NO: 109) 
110); TAGAAAAACA ( SEQ ID NO: 111) 
112); GAAAAACATG ( SEQ ID NO: 113) 
114); AAAACATGGA ( SEQ ID NO: 115) 
116); AACATGGAGC (SEQ ID NO: 117) 
118); CATGGAGCAA (SEQ ID NO: 119) 
120); TGGAGCAATC (SEQ ID NO: 121) 
122); GAGCAATCAC ( SEQ ID NO: 123) 
124); GCAATCACAA (SEQ ID NO: 125) 
126); AATCACAAGT (SEQ ID NO: 127) 
128); TCACAAGTAG (SEQ ID NO: 129) 
130); ACAAGTAGCA ( SEQ ID NO : 131) 
132); AAGTAGCAAT ( SEQ ID NO: 133) 
134); GTAGCAATAC (SEQ ID NO: 13 5) 
136); AGCAATACAG ( SEQ ID NO: 137) 
138) ; CAATACAGCA ( SEQ ID NO: 139) 
140); ATACAGCAGC ( SEQ ID NO: 141) 
142); ACAGCAGCTA ( SEQ ID NO: 143) 
144); AGCAGCTAAC ( SEQ ID NO: 145) 
146) ; CAGCTAACAA ( SEQ ID NO: 147) 
148) ; GCTAACAATG ( SEQ ID NO : 14 9) 
150); TAACAATGCT ( SEQ ID NO: 151) 
152); ACAATGCTGC ( SEQ ID NO: 153) 
154); AATGCTGCTT (SEQ ID NO: 155) 
: 156); TGCTGCTTGT ( SEQ ID NO: 157) 
: 158); CTGCTTGTGC (SEQ ID NO: 159) 
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TGCTTGTGCC (SEQ ID NO : 160); 
CTTGTGCCTG (SEQ ID NO: 162); 
TGTGCCTGGC (SEQ ID NO: 164); 
TGCCTGGCTA ( SEQ ID NO: 166); 
5 CCTGGCTAGA ( SEQ ID NO : 168); 
TGGCTAGAAG (SEQ ID NO: 170); 
GCTAGAAGCA ( SEQ ID NO: 172); 
TAGAAGCACA ( SEQ ID NO: 174); 
GAAGCACAAG ( SEQ ID NO: 176); 
10 AGCACAAGAG ( SEQ ID NO: 178); 
CACAAGAGGA (SEQ ID NO: 180); 
CAAGAGGAGG (SEQ ID NO : 182); 
AGAGGAGGAA ( SEQ ID NO: 184); 
AGGAGGAAGA ( SEQ ID NO : 186); 
15 GAGGAAGAGG ( SEQ ID NO: 188); 
GGAAGAGGTG (SEQ ID NO : 190); 
AAGAGGTGGG ( SEQ ID NO: 192); 
GAGGTGGGTT ( SEQ ID NO: 194); 
GGTGGGTTTT (SEQ ID NO : 196); 
20 TGGGTTTTCC (SEQ ID NO: 198); 
GGTTTTCCAG ( SEQ ID NO: 200); 
TTTTCCAGTC (SEQ ID NO: 202) ; 
TTCCAGTCAC ( SEQ ID NO: 2 04); 
CCAGTCACAC(SEQ ID NO: 206) ; 
25 AGTCACACCT ( SEQ ID NO: 208); 
TCACACGTCA (SEQ ID NO: 210); 
ACACCTCAGG (SEQ ID NO: 212); 
ACCTCAGGTA (SEQ ID NO: 214); 
CTCAGGTACC (SEQ ID NO : 216); 
30 CAGGTACCTT (SEQ ID NO: 218); 
GGTACCTTTA (SEQ ID NO : 220); 
TACCTTTAAG ( SEQ ID NO : 222); 
CCTTTAAGAC ( SEQ ID NO: 224); 
TTTAAGACCA (SEQ- ID NO: 226); 
35 TAAGACCAAT (SEQ ID NO: 228); 
AGACCAATGA ( SEQ ID NO: 230); 
ACCAATGACT ( SEQ ID NO: 232); 
CAATGACTTA ( SEQ ID NO: 234); 
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GCTTGTGCCT (SEQ ID NO: 161); 
TTGTGCCTGG ( SEQ ID NO: 163); 
GTGCCTGGCT (SEQ ID NO: 165); 
GCCTGGCTAG (SEQ ID NO : 167); 
CTGGCTAGAA ( SEQ ID NO : 169); 
GGCTAGAAGC ( SEQ ID NO : 171); 
CTAGAAGCAC (SEQ ID NO: 173); 
AGAAGCACAA ( SEQ ID NO: 175); 
AAGCACAAGA (SEQ ID NO: 177); 
GCACAAGAGG (SEQ ID NO: 179); 
ACAAGAGGAG ( SEQ ID NO : 181); 
AAGAGGAGGA ( SEQ ID NO : 183); 
GAGGAGGAAG (SEQ ID NO : 185); 
GGAGGAAGAG ( SEQ ID NO: 187); 
AGGAAGAGGT ( SEQ ID NO: 18 9); 
GAAGAGGTGG ( SEQ ID NO: 191); 
AGAGGTGGGT ( SEQ ID NO : 193); 
AGGTGGGTTT (SEQ ID NO : 195); 
GTGGGTTTTC (SEQ ID NO: 197); 
GGGTTTTCCA (SEQ ID NO: 199); 
GTTTTCCAGT ( SEQ ID NO : 201); 
TTTCCAGTCA ( SEQ ID NO: 203); 
TCCAGTCACA ( SEQ ID NO: 205) ; 
CAGTCACACC(SEQ ID NO: 207); 
GTCACACCTC(SEQ ID NO: 209) ; 
CACACCTCAG(SEQ ID NO: 211) ; 
CACCTCAGGT (SEQ ID NO: 213); 
CCTCAGGTAC (SEQ ID NO : 215); 
TCAGGTACCT (SEQ ID NO: 217) ■ 
AGGTACCTTT (SEQ ID NO: 219); 
GTACCTTTAA (SEQ ID NO: 221); 
ACCTTTAAGA (SEQ ID NO: 223); 
CTTTAAGACC (SEQ ID NO: 225); 
TTAAGACCAA (SEQ ID NO: 227); 
AAGACCAATG (SEQ ID NO : 229); 
GACCAATGAC (SEQ ID NO: 231); 
CCAATGACTT (SEQ ID NO: 233); 
AATGACTTAC (SEQ ID NO : 23 5) ; 
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ATGACTTACA ( SEQ ID NO: 
GACTTACAAG ( SEQ ID NO: 
CTTACAAGGC ( SEQ ID NO: 
TACAAGGCAG ( SEQ ID NO: 
5 CAAGGCAGCT ( SEQ ID NO: 
AGGCAGCTGT (SEQ ID NO: 
GCAGCTGTAG ( SEQ ID NO: 
AGCTGTAGAT (SEQ ID NO: 
CTGTAGATCT (SEQ ID NO: 
10 GTAGATCTTA (SEQ ID NO: 
AGATCTTAGC (SEQ ID NO: 
ATCTTAGCCA ( SEQ ID NO: 
CTTAGCCACT (SEQ ID NO: 
TAGCCACTTT (SEQ ID NO: 
15 GCCACTTTTT (SEQ ID NO: 
CACTTTTTAA (SEQ ID NO: 
CTTTTTAAAA ( SEQ ID NO: 
TTTTAAAAGA (SEQ ID NO: 
TTAAAAGAAA ( SEQ ID NO: 
20 AAAAGAAAAG (SEQ ID NO: 
AAGAAAAGGG ( SEQ ID NO : 
GAAAAGGGGG (SEQ ID NO: 
AAAGGGGGGA (SEQ ID NO: 
AGGGGGGACT (SEQ ID NO: 
25 GGGGGACTGG ( SEQ ID NO: 
GGGACTGGAA ( SEQ ID NO: 
GACTGGAAGG ( SEQ ID NO: 
CTGGAAGGGC (SEQ ID NO: 
GGAAGGGGTA ( SEQ ID NO: 
30 AAGGGCTAAT ( SEQ ID NO : 
GGGCTAATTC ( SEQ ID NO 
GCTAATTCAC ( SEQ ID NO 
TAATTCACTC ( SEQ ID NO 
ATTCACTCCC ( SEQ ID NO 
35 TCACTCCCAA(SEQ ID NO 
ACTCCCAAAG ( SEQ ID NO 
TCCCAAAGAA ( SEQ ID NO 
CCAAAGAAGA ( SEQ ID NO 



236) ; 
238) ; 
240) ; 
242) ; 
244) ; 
246) ; 
248) ; 
250) ; 
252) ; 
254) ; 
256) ; 
258) ; 
26 0) ; 
262) ; 
264) ; 
266) ; 
268) ; 
270) ; 
272) ; 
274) ; 
276) ; 
278) ; 
280) ; 
282) ; 
284) ; 
286) ; 
: 288) ; 
: 290) ; 
: 292) ; 
: 294) ; 
: 296) ; 
: 298) ; 
: 300) ; 
: 3 02) ; 
: 304) ; 
: 306) ; 
308) ; 
310) ; 



TGACTTACAA ( SEQ ID NO : 237) ; 
ACTTACAAGG ( SEQ ID NO : 239); 
TTACAAGGCA ( SEQ ID NO : 241); 
ACAAGGCAGC (SEQ ID NO: 243); 
AAGGCAGCTG (SEQ ID NO: 245); 
GGCAGCTGTA ( SEQ ID NO: 247) ; 
CAGCTGTAGA (SEQ ID NO: 249); 
GCTGTAGATC (SEQ ID NO : 251); 
TGTAGATCTT (SEQ ID NO : 253); 
TAGATCTTAG (SEQ ID NO: 255); 
GATCTTAGCC (SEQ ID NO: 257) ; 
TCTTAGCCAC (SEQ ID NO: 259); 
TTAGCCACTT (SEQ ID NO: 261); 
AGCCACTTTT (SEQ ID NO: 263); 
CCACTTTTTA ( SEQ ID NO : 265); 
ACTTTTTAAA (SEQ ID NO : 267) ; 
TTTTTAAAAG (SEQ ID NO : 269) ; 
TTTAAAAGAA (SEQ ID NO: 271); 
TAAAAGAAAA ( SEQ ID NO: 273); 
AAAGAAAAGG ( SEQ ID NO: 275); 
AGAAAAGGGG ( SEQ ID NO : 277); 
AAAAGGGGGG (SEQ ID NO : 279); 
AAGGGGGGAC ( SEQ ID NO: 281); 
GGGGGGACTG ( SEQ ID NO : 283); 
GGGGACTGGA ( SEQ ID NO : 285); 
GGACTGGAAG ( SEQ ID NO : 2 87); 
ACTGGAAGGG ( SEQ ID NO: 289); 
TGGAAGGGCT ( SEQ ID NO: 2 91); 
GAAGGGCTAA (SEQ ID NO: 293); 
AGGGCTAATT ( SEQ ID NO: 295); 
GGCTAATTCA ( SEQ ID NO : 297); 
CTAATTCACT ( SEQ ID NO: 2 99); 
AATTCACTCC (SEQ ID NO: 3 01); 
TTCACTCCCA(SEQ ID NO : 3 03) ; 
CACTCCCAAA(SEQ ID NO : 3 05) ; 
CTCCCAAAGA ( SEQ ID NO : 307) ; 
CCCAAAGAAG ( SEQ ID NO : 3 09); 
CAAAGAAGAC ( SEQ ID NO: 311); 
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AAAGAAGACA ( SEQ ID NO: 
AGAAGACAAG (SEQ ID NO : 
AAGACAAGAT (SEQ ID NO: 
GACAAGATAT (SEQ ID NO: 
5 CAAGATATCC (SEQ ID NO: 
AGATATCCTT (SEQ ID NO: 
ATATCCTTGA (SEQ ID NO: 
ATCCTTGATC (SEQ ID NO: 
CCTTGATCTG (SEQ ID NO: 
10 TTGATCTGTG (SEQ ID NO: 
GATCTGTGGA (SEQ ID NO: 
TCTGTGGATC (SEQ ID NO: 
TGTGGATCTA (SEQ ID NO: 
TGGATCTACC (SEQ ID NO: 

15 GATCTACCAC (SEQ ID NO: 
TCTACCACAC (SEQ ID NO: 
TACCACACAC (SEQ ID NO: 
CCACACACAA(SEQ ID NO: 
ACACACAAGG ( SEQ ID NO: 

20 ACACAAGGCT (SEQ ID NO: 
ACAAGGCTAC (SEQ ID NO: 
AAGGCTACTT (SEQ ID NO: 
GGCTACTTCC (SEQ ID NO: 
CTACTTCCCT (SEQ ID NO: 

25 ACTTCCCTGA ( SEQ ID NO: 
TTCCCTGATT (SEQ ID NO: 
CCCTGATTGG (SEQ ID NO: 
CTGATTGGCA ( SEQ ID NO: 
GATTGGCAGA ( SEQ ID NO: 

30 TTGGCAGAAC (SEQ ID NO: 
GGCAGAACTA ( SEQ ID NO: 
CAGAACTACA ( SEQ ID NO: 
GAACTACACA ( SEQ ID NO: 
ACTACACACC(SEQ ID NO: 

35 TACACACCAG (SEQ ID NO: 
CACACCAGGG ( SEQ ID NO: 
CACCAGGGCC(SEQ ID NO: 
CCAGGGCCAG (SEQ ID NO: 
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AGGGCCAGGG (SEQ 
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ID 
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462) , 


• AGAGGCCAAT (SEQ 


ID 


NO: 
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GAGGCCAATA (SEQ 
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Generally, the subject HIV-1 isolate is non-pathogenic as hereinbefore defined. 
Additionally, reference herein to n a deletion" includes reference to a contiguous or non- 
contiguous series of two or more deletions. 

5 The non-pathogenic isolate may carry a single decanucleotide deletion or may carry 
more than one decanucleotide deletion. Where it carries multiple deletions these may 
all correspond to a contiguous sequence or be from different parts of the nef gene. 
Furthermore, the terminal end portions of a deletion may lie within a decanucleotide as 
defined above. It is emphasised that the present invention extends to analogous 
10 sequences from other pathogenic strains of HIV-1 which might carry nef genes with a 
slightly altered sequence relative to HIV-1 strain NL4-3. 

In a most preferred embodiment of the present invention, there is provided a non- 
pathogenic strain of HIV-1 carrying a nucleotide sequence in its genome as set forth 
15 in SEQ ID No. 614: 

GAAGAGATTTGGGAGAACATGACCTGGATGCAGTGGGAAAAAGAAATTCACAATCACAC 
AAAATACATATACTCCTTACTTGAAAAATCGCAGAAGCAACAAGAAAAGAATGAACAAG 
AACTATTGGAATTGGATCAATGGGCAAGTTTGTGGAATTGGTTTGACATAACAAAATGG 
CTGTGGTATATAAAAATATTCATAATGGTAGTAGGAGGCTTGATAGGTTTAAGAATAGT 

20 TTTTGCTGTACTTTCTATAGTGAATAGAGTTAGGCAGGGATACTCACCATTGTCGTTTC 
AGACCCTCCTCCCAACCCCGAGGGGACCCGACAGGCCCGAAGGAATCGAAGAAGAAGGT 
GGAGAGAGAGACAGAGACAGATCCACTCGATTAGTACACGGATTCTTAGCACTTTTCTG 
GGACGACCTGAGGAGCCTGTGCCTCTTCCTCTACCACCACTTGAGAGACTTACTCTTGA 
TTGTAACAAGGATTGTGGAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATATTGG 

25 TGGAACCTCCTAAAGTATTGGAGCCAGGAACTGCAGAAGAGTGCTGTTATCTTGCTCAA 
TGCCACCGCCATAGCAGTAGCTGAGGGGACAGATAGAGTTTTAGAAGTATTACAAAGAG 
CTTATAGAGCTATCCTCCACATACCTAGAAGAATAAGACAGGGCCTCGAAATGGCTTTG 
CTATAAAATGGGTGGCAAGTGAGCAAAAAGTAGTGTAGTCAGATAGCATGCATCATAAG 
GGGTGGGGGCCAACAACTAACAATGCTGATCGTGCCTGGCTAGAAGCACAAGAGAAGGA 

30 AGAAGCGGGTTTTCCAGTCAAACCTCAGGTAGCTGTAGATCTTAGCCACTTTTTAAAAG 
AAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGATACACAGTGCTGC 
AAACTATTACCAGTGGAGTCAGCGAAGATAGAAGAGGCCAATGGAGGAGAAAACCACAG 
ATTGTTCTGTTGGGGACTTTCCATCCGTTGGGGACTTTCCAAGGCGGCGTGGCCTGGGT 
GACTAGTTCCGGTGGGGACTTTCCAAGAAGGCGCGGCCTGGGCGGGACTGGGGAGTGGC 

35 GAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTGCTGTTACTGGGTCTCTCGGGTT 
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AGACCAGATCTGAGCCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTC 
AATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGT 

ATCTAGA; 



5 and/or SEQ ID NO: 615: 

GAAACAATTTGGGATAACATGACCTGGATGCAGTGGGAAAGAGAAATTGACAATTACAC 
AAACATAATATACACCTTAATTGAAGAATCGCAGAACCAACAAGAAAAAAATGAACTAG 
AATTATTGGAATTGGATAAATGGGCAAATTTGTGGAATTGGTTTAGTATATCAAACTGG 
CTATGGTATATAAAATTATTCATAATGGTAGTAGGAGGCTTGGTAGGTTTAAGAATAGT 

1 0 TTTTACTGTACTTTCTATAGTGAATAGAGTTAGGCAGGGATACTCACCATTGTCGTTTC 
AGACCCACCTCCCAACCCCGAAGGGACCCGACAGGCCAGAAGGAATCGAAGAAGAAGGT 
GGAGAGAGAGACAGAGGCAGCTCCACTCGATTAGTGCACGGATTCTTAGCACTTTTCTG 
GGACGACCTGAGGAGTCTGTGCCTCTTCAGCTACCACCACTTGAGAGACTTACTCTTGA 
TTGTAACGAGGATTGTGGAACTTCTGGGAGGCAGGGGATGGGAAGCCCTCAAATACTGG 

1 5 TGGAATCTCCTGCAGTATTGGAGGCAGGAACTACAGAAGAGTGCTGTTAGCTTGTTCAA 
TGGCACGGCCATAGCAGTAGCTGAGGGGACAGATAGAGTTATAGAAGCTTTACGAAGGG 
CTTATAGAGCTATTCTCCACATACCTAGAAGAATAAGACAGGGCTTAGAAAGGGCTTTG 
CTATAAAATGGGTGGCAAGTGGTCAGAAAGTAGTGTGGTTAGAAGGCATGTACCTTTAA 
GACAAGGCAGCTATAGATCTTAGCCGCTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCT 

20 AATTCACTCACAGAGAAGATCAGTTGAACCAGAAGAAGATAGAAGAGGCCATGAAGAAG 
AAAACAACAGATTGTTCCGTTTGTTCCGTTGGGGACTTTCCAGGAGACGTGGCCTGAGT 
GATAAGCCGCTGGGGACTTTCCGAAGAGGCGTGACGGGACTTTCCAAGGCGACGTGGCC 
TGGGCGGGACTGGGGAGTGGCGAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTGC 
CTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCTAACTAG 

25 GGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCC 
CGTCTGTTGTGTGACTCTGGTATCTAGA . 

The present invention, however, extends to HIV-1 isolates which are non-pathogenic; 
carry genomes capable of hybridising under low stringency conditions to SEQ ID NO: 
30 614 or SEQ ID NO: 615; and which do not direct the synthesis of a full length ne/gene 
product. 

In a further embodiment the present invention contemplates a viral isolate which: 

(i) is reactive to antibodies to a glycoprotein from HIV-1 such as at least one of 
35 gp41-45, gpl20 and/or gpl60; 

(ii) carries a genome or a part or fragment thereof capable of hybridising under 
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medium stringency conditions to a nucleotide sequence as set forth in SEQ ID 
NO: 1 or a complementary form thereof or an analogous sequence from another 
pathogenic strain of HIV-1; 

(iii) carries a deletion of at least ten nucleotides in a region corresponding to the LTR 
5 region in HIV-1 NL4-3; and 

wherein said deletion encompasses one or more of the following decanucleotides from 

the LTR region of HIV-1 NL4-3 or corresponding sequences from another pathogenic 

strain of HIV-1: 



10 


GCTTTTTGCC 


(SEO 


ID 


NO : 


652 ) • 


fTTTTT^PPT 




Tn 
xu 


jnu : 


/TCI \ 
DDJ | 




TTTTTGCCTG 

X X X X X X VJ 


( QFC) 
\ oLy 


TD 


"ran ♦ 


£ c; a \ . 


ill IbLLlbl 


(SEQ 


ID 


NO : 


655 ) 








XL) 


IMU : 


c c c \ . 
bbo) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO : 


657) 








ID 


NO : 


r cr r» \ 

658) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 


659) 






(SEQ 


ID 


NO : 


6 60) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 


661) 


1 5 




( bh*Q 


ID 


NO : 


6 62) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 


663) 




x ^i*w x ± \ x 






MO - 


CCA \ . 


ACTGGGTCTC 


(SEQ 


ID 


NO : 


665) 




CTGGGTPTVT 

v — ■ X \J\J\J X V., ± \_ x 




TTi 

-L U 




bob/ ; 


rppp/-im/-imprp/-i 

1 CjCjCj 1 CI CTC 


(SEQ 


ID 


NO : 


667) 




GGGTCTCTPT 

»JVJU X X W X \w X 




xu 


Km • 


boo; } 


CjLj 1LJ.LILIG 


(SEQ 


ID 


NO : 


669) 




GTCTCTPTGH 

v_J X * — X V- X V— » X 




X u 


vrn . 

1^ * 


b / u / ; 




(SEQ 


ID 


NO : 


671) 


20 


^ X v_. J. \_ X ww x x 




XU 


V*<J Z 


o / z } ; 


TCTCTGGTTA 


(SEQ 


ID 


NO : 


673 ) 




LlLl CjGTTAG 


(SEQ 


ID 


NO : 


674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


675) 




CTGGTTAGAC 


(SEQ 


ID 


NO: 


676) ; 


TGGTTAGACC 


(SEQ 


ID 


NO: 


677) 




GGTTAGACCA 


(SEQ 


ID 


NO: 


678) ; 


GTTAGACCAG 


(SEQ 


ID 


NO: 


679) 




TTAGACCAGA 


(SEQ 


ID 


NO: 


680) ; 


TAGACCAGAT 


(SEQ 


ID 


NO: 


681) 


25 


AGACCAGATC 


(SEQ 


ID 


NO: 


682) ; 


GACCAGATCT 


(SEQ 


ID 


NO: 


683) 




ACCAGATCTG 


(SEQ 


ID 


NO: 


684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO: 


685) 




CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO: 


687) 




GATCTGAGCC 


(SEQ 


ID 


NO: 


688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO: 


689) 




TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO: 


691) 


30 


TGAGCCTGGG 


(SEQ 


ID 


NO: 


692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 


693) 




AGCCTGGGAG 


(SEQ 


ID 


NO: 


694) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO: 


695) 




CCTGGGAGCT 


(SEQ 


ID 


NO: 


696) ; 


CTGGGAGCTC 


(SEQ 


ID 


NO: 


697) 




TGGGAGCTCT 


(SEQ 


ID 


NO: 


698) ; 


GGGAGCTCTC 


(SEQ 


ID 


NO: 


699) 




GGAGCTCTCT 


(SEQ 


ID 


NO: 


700) ; 


GAGCTCTCTG 


(SEQ 


ID 


NO: 


701) 


35 


AGCTCTCTGG 


(SEQ 


ID 


NO: 


702) ; 


GCTCTCTGGC 


(SEQ 


ID 


NO: 


703) 




CTCTCTGGCT 


(SEQ 


ID 


NO: 


704) ; 


TCTCTGGCTA 


(SEQ 


ID 


NO: 


705) 



WO 95/21912 



PCT/AU95/UUU0^ 



CTCTGGCTAA (SEQ ID NO: 706) ; 
CTGGCTAACT (SEQ ID NO: 708) ; 
GGCTAACTAG (SEQ ID NO: 710); 
CTAACTAGGG (SEQ ID NO: 712); 
5 AACTAGGGAA (SEQ ID NO: 714) j 
CTAGGGAACC (SEQ ID NO: 716), 
AGGGAACCCA (SEQ ID NO: 718); 
GGAACCCACT (SEQ ID NO: 72 0) l 
AACCCACTGC (SEQ ID NO: 722), 

10 CCCACTGCTT (SEQ ID NO: 724), 
CACTGCTTAA (SEQ ID NO: 726) , 
CTGCTTAAGC (SEQ ID NO : 72 8) 
GCTTAAGCCT (SEQ ID NO : 73 0) 
TTAAGCCTCA (SEQ ID NO: 73 2) 

15 AAGCCTCAAT (SEQ ID NO: 734) 
GCCTCAATAA (SEQ ID NO: 73 6) 
CTCAATAAAG (SEQ ID NO: 73 8) 
CAATAAAGCT (SEQ ID NO: 74 0) 
ATAAAGCTTG (SEQ ID NO: 742) 

20 AAAGCTTGCC (SEQ ID NO; 744) 
AGCTTGCCTT (SEQ ID NO : 746) 
CTTGCCTTGA (SEQ ID NO : 74 8) 
TGCCTTGAGT (SEQ ID NO : 750) 
CCTTGAGTGC (SEQ ID NO: 752) 

25 TTGAGTGCTT (SEQ ID NO : 754) 
GAGTGCTTCA (SEQ ID NO : 756) 
GTGCTTCAAG (SEQ ID NO: 758) 
GCTTCAAGTA (SEQ ID NO : 76 0) 
TTCAAGTAGT (SEQ ID NO: 762) 

30 CAAGTAGTGT (SEQ ID NO: 764) 
AGTAGTGTGT (SEQ ID NO : 766) 
TAGTGTGTGC (SEQ ID NO : 76 8) 
GTGTGTGCCC (SEQ ID NO : 770) 
GTGTGCCCGT (SEQ ID NO: 772) 

35 GTGCCCGTCT (SEQ ID NO: 774) 
GCCCGTCTGT (SEQ ID NO : 776) 
CCGTCTGTTG (SEQ ID NO : 778) 
GTCTGTTGTG (SEQ ID NO: 780) 



TCTGGCTAAC (SEQ ID NO : 707) ; 
TGGCTAACTA (SEQ ID NO: 709); 
GCTAACTAGG (SEQ ID NO: 711) ; 
TAACTAGGGA (SEQ ID NO: 713); 
ACTAGGGAAC (SEQ ID NO: 715); 
TAGGGAACCC (SEQ ID NO: 717); 
GGGAACCCAC (SEQ ID NO: 719); 
GAACCCACTG (SEQ ID NO: 721); 
ACCCACTGCT (SEQ ID NO: 723); 
CCACTGCTTA (SEQ ID NO: 725); 
ACTGCTTAAG (SEQ ID NO: 727); 
TGCTTAAGCC (SEQ ID NO : 729) ; 
CTTAAGCCTC (SEQ ID NO: 731); 
TAAGCCTCAA (SEQ ID NO: 733); 
AGCCTCAATA (SEQ ID NO: 73 5) ; 
CCTCAATAAA (SEQ ID NO: 737); 
TCAATAAAGC (SEQ ID NO: 739); 
AATAAAGCTT (SEQ ID NO: 741); 
TAAAGCTTGC (SEQ ID NO: 743); 
AAGCTTGCCT (SEQ ID NO: 74 5) ; 
GCTTGCCTTG (SEQ ID NO: 747); 
TTGCCTTGAG (SEQ ID NO: 749); 
GCCTTGAGTG (SEQ ID NO: 751) ; 
CTTGAGTGCT (SEQ ID NO : 753); 
TGAGTGCTTC (SEQ ID NO: 755) ; 
AGTGCTTCAA (SEQ ID NO : 757); 
TGCTTCAAGT (SEQ ID NO : 759) ; 
CTTCAAGTAG ( SEQ ID NO : 761); 
TCAAGTAGTG (SEQ ID NO : 763) ; 
AAGTAGTGTG (SEQ ID NO : 765); 
GTAGTGTGTG (SEQ ID NO : 767) ; 
AGTGTGTGCC (SEQ ID NO: 76 9) ; 
TGTGTGCCCG (SEQ ID NO : 771) ; 
TGTGCCCGTC (SEQ ID NO: 773) ; 
TGCCCGTCTG (SEQ ID NO: 775) ; 
CCCGTCTGTT (SEQ ID NO: 777) ; 
CGTCTGTTGT (SEQ ID NO: 119) ; 
TCTGTTGTGT (SEQ ID NO: 781) ; 
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CTGTTGTGTG (SEQ ID NO: 782); TGTTGTGTGA (SEQ ID NO: 783) 
GTTGTGTGAC (SEQ ID NO: 784); TTGTGTGACT (SEQ ID NO : 785) 
TGTGTGACTC (SEQ ID NO: 786); GTGTGACTCT (SEQ ID NO: 787) 
TGTGTGACTC (SEQ ID NO: 788); GTGTGACTCT (SEQ ID NO: 789) 
5 TGTGACTCTG (SEQ ID NO: 790); GTGACTCTGG (SEQ ID NO: 791) 
TGACTCTGGT (SEQ ID NO: 792); GACT CTGGTA (SEQ ID NO: 793) 
ACTCTGGTAA (SEQ ID NO: 794); CTCTGGTAAC (SEQ ID NO: 795) 
TCTGGTAACT (SEQ ID NO: 796); CTGGTAACTA (SEQ ID NO: 797) 
TGGTAACTAG (SEQ ID NO: 798); GGTAACTAGA (SEQ ID NO: 799). 

10 

The non-pathogenic isolate may carry a single decanucleotide deletion in the LTR region 
or may carry multiple deletions in the same region or in the LTR region and another 
region such as the nef gene. In particular, the mutation may be in the LTYUnef overlap 
region. Where it carries multiple deletions, these may correspond to a contiguous 
15 sequence or be from different parts of the LTR region and/or ne/gene. Furthermore, 
the terminal end portions of a deletion may lie within a decanucleotide as defined above. 

Yet another aspect of the present invention provides an infectious molecular clone 
comprising genetic sequences derived from the non-pathogenic HIV-I isolates as 
20 hereinbefore described and includes genetic sequences encoding major structural proteins 
such as gag, env and pol Infectious molecular clones are particularly useful as genetic 
compositions capable of "infecting" host cells without need of viral coat. The infectious 
molecular clones of the present invention may also be derived from pathogenic HIV-1 
strains rendered non-pathogenic as hereindescribed. 

25 

According to this latter embodiment, there is contemplated a method of attenuating a 
pathogenic strain of HIV-1, said method comprising inducing a mutation in the ne/gene 
and/or an LTR region to generate a non-pathogenic HIV-1 strain as hereinbefore 
described. Preferred mutations are deletions of at least ten nucleotides such as one or 
30 more of the decanucleotides as hereinbefore described. The mutations may also 
constitute substitutions and/or insertions of heterologous nucleic acid molecules in the 
nef and/or LTR regions such as the incorporation of a sense (i.e. co-suppression) 
antisense or ribozyme molecule (see below). 
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In still yet another embodiment of the present invention, there is provided an isolated, 
non-pathogenic strain of HIV-1 comprising a deletion in its genome of at least 10 
nucleotide within the region of nucleotide 8787 and 9709 and more particularly within 
the region 8800 and 9700 and even more preferably within the region 8800 and 9410, 
5 using the nucleotide numbering of HIV-1 NL4-3. 

In one embodiment, the deletion of at least 10 nucleotides is from within a region 
selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4-3): 
nucleotide (i) 8830-8862; 
10 (ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

In another embodiment, the deletion of at least 10 nucleotides is from within a region 
15 selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4-3): 
nucleotide (v) 9281-9371: 



20 



(vi) 9281-9362 

(vii) 9105-9224 

(viii) 9271-9370. 



and 



In yet another embodiment, the deletion of at least 10 nucleotides is from within a 
region selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4- 

nucleotide (ix) 8882-8928; 

25 (x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

In still yet another embodiment, the deletion of at least 10 nucleotides is from within a 
30 region selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4- 

3): 
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nucleotide 



(xiii) 
(xiv) 



9105-9224; 



9389-9395; and 



(xv) 



9281-9366. 



The above embodiments including any combinations thereof or functional equivalents 
thereof. Most preferred variants of HIV- 1 are defined in Table 3. 

Reference herein to the non-pathogenic HIV-1 strains of the present invention includes 
10 reference to components, parts, fragments and derivatives thereof including both genetic 
and non-genetic material. Furthermore, the non-pathogenic HIV-1 strains may be in 
isolated form or resident in peripheral blood mononuclear cells (PBMCs) or other like 
cells where the genome of the HIV-1 strains is integrated as DNA from said HIV-1 
strains such as proviral DNA. In addition, the present invention extends to recombinant 
1 5 virus such as from (or resident in) prokaryotes or eukaryotes as well as in the form of 
infectious molecular clones. 

Accordingly, the present invention provides for the non-pathogenic HIV-1 isolate, 
genomic material therefrom, complementary proviral DNA, molecular infectious clones, 
20 recombinant viral particles or genetic sequences therefrom or cells expressing same or 
blood cells carrying proviral DNA or to any mutants, derivatives, components, 
fragments, parts, homologues or analogues of the foregoing. 

The non-pathogenic HIV-1 strains of the present invention are particularly useful in the 
25 development of therapeutic compositions, therapeutic molecules and/or diagnostic 
reagents. With regards to the former, the non-pathogenic HIV-1 strain may be 
considered as a live attenuated vaccine where individuals carrying DNA derived from 
said non-pathogenic HIV-1 strain such as proviral DNA in target cells are protected from 
infection by a corresponding pathogenic strain. The term "vaccine" is used in its broadest 
30 sense as a therapeutic composition or molecule which prevents or reduces HIV-1 
infection or risk of infection or which ameliorates the symptoms of infection. It may 
involve the stimulation of an immune response or may involve blocking HIV-1 cells 
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receptors and/or the use of genetic compositions, for example, to introduce ribozymes 
or antisense molecules to HIV-1 directed genetic sequences or to prepare infectious 
molecular clones. For convenience, all such compositions are referred hereinafter to 
"therapeutic compositions". 

5 

Accordingly, the present invention contemplates a method for inhibiting or reducing the 
risk of infection by a pathogenic strain of HIV-1, said method comprising administering 
to a subject a non-pathogenic HIV-1 as hereinbefore defined in an amount effective to 
infect target cells and to generate target cells carrying proviral DNA from said non- 
10 pathogenic HIV-1. More particularly, the present invention contemplates a method for 
inhibiting or reducing productive infection of an individual by a pathogenic strain of 
HIV-1, said method comprising administering to a subject a non-pathogenic isolate of 
HIV-1 in an amount effective to infect target cells and to generate target cells carrying 
proviral DNA from said non-pathogenic HIV-1. By "productive infection" as used in 
15 the specification and claims herein is meant the infection of a cell or cells by a 
pathogenic strain of HIV-1 which leads ultimately to the symptomology of AIDS or 
AIDS related diseases. A cell infected productively produces pathogenic virions. By 
definition, infection of an individual by a non-pathogenic strain of HIV-1 would not lead 
to productive infection. Non pathogenic HIV-1 strains generally replicate to a sufficient 
20 extent to protect cells against challenge by virulent or pathogenic strains. The methods 
of the present invention are applicable prophylactically (i.e. to prevent de novo infection) 
or therapeutically (i.e. to reduce or slow disease progression). 

The present invention further provides a method for vaccinating an individual against 
25 the development of AIDS or AIDS related diseases, said method comprising 
administering to said individual a non-pathogenic isolate of HIV-1 in an amount 
effective to infect target cells and to generate target cells carrying proviral DNA from 
said non-pathogenic HIV-L The term "vaccinating" should not be taken as limiting the 
invention to the prevention of HIV-1 infection by solely immunological means. The 
30 term "vaccinating" includes any means of preventing productive infection of an 
individual by pathogenic HIV-1. 
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As an alternative to the above methods, a therapeutic composition as hereinbefore 
defined is administered. The non-pathogenic isolate may be administered inter alia as 
an isolated viral preparation or via infected blood cells. Another aspect of the invention 
provides a therapeutic composition for inhibiting or reducing the risk of infection by a 
5 pathogenic strain of HIV-1 said therapeutic composition comprising a non-pathogenic 
strain of HIV-1 or genetic sequences derived therefrom as hereinbefore described and 
optionally one or more pharmaceutically acceptable carriers and/or diluents. The 
therapeutic composition of the present invention is generally suitable for intravenous, 
intraperitoneal, intramuscular, intramucosal (e.g. nasal spray, respiratory spray) or other 

10 forms of parenteral administration. The therapeutic composition might also be 
administered via an implant or rectally or orally. In addition to the mutations 
contemplated above, the non-pathogenic HIV-1 strain may also contain one or more 
other mutations to further reduce the risk of reversion to virulence and/or to insert a 
genetic sequence capable of providing directly or indirectly an identifiable signal, having 

15 further anti-HIV-1 properties and/or immunostimulatory or cell regulatory properties. 

For example, the non-pathogenic HIV-1 isolate in the therapeutic composition may 
comprise additional genetic material capable of directing expression of antisense 
nucleotide sequences to inhibit expression of one or more proteins encoded by a 
20 pathogenic strain of HIV-1. Alternatively, sense co-suppression may be employed. 
Preferred sense or antisense molecules would reduce expression of the ne/gene or affect 
normal functioning of the LTR region. 

According to this embodiment, the non-pathogenic HIV-1 strain may be considered as 
25 a targeting agent to introduce genetic constructs capable of reducing expression of one 
or more HIV-1 proteins or polypeptides. In this embodiment there is provided a viral 
isolate which: 

(i) is genetically or immunologically related to a pathogenic strain of HIV-1; 

(ii) is substantially non-pathogenic in human subjects; 

30 (iii) comprises a first nucleotide sequence constituting its genome which is capable 
of hybridising under medium stringency conditions to SEQ ID NO: 1 or a 
complementary form thereof; and 
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(iv) comprises a second nucleotide sequence within said first nucleotide sequence and 
which second nucleotide sequence directs expression of a mRNA molecule 
capable of inhibiting, reducing or otherwise down-regulating translation of a 
protein or polypeptide encoded by a pathogenic strain of HIV- 1 or inhibit, reduce 
5 or otherwise down regulate operation of a non-protein encoding a region of a 

pathogenic strain of HIV-1 . 



Preferred proteins or polypeptides targeted for reduced expression are those encoded by 
one or more of the following: gag, pol, env f tat, rev, vpu, vpr t vz/and/or we/genes. The 

10 most preferred protein or polypeptide targeted for reduced expression is the product of 
the nef gene. Alternatively, the target nucleotide sequence does not encode a 
polypeptide or protein but is required for other functions such as integration or excision 
from the human genome or expression of genes on the viral genome. An example of 
such a nucleotide sequence is the LTR region. Accordingly, the present invention 

15 extends to disruption to the function of such regions. 

The above aspect relates to use of antisense technology. The present invention extends, 
however, to use of ribozymes and/or co-suppression to achieve the same results. In an 
alternative embodiment, or in addition to, the second (or optionally a third) nucleotide 
20 sequence encodes an antiviral agent (e.g. interferon) and/or an immune enhancing agent. 

The identification of deletions inter alia in the nef gene and/or LTR region in 
asymptomatic subjects provides a unique opportunity to study the in vivo effects of 
attenuated HIV-1 strains carrying one or more mutations in selected genetic regions. In 
particular, the present invention provides a means for designing therapeutic compositions 
directed to inhibiting expression of a nef gene and/or LTR region in a pathogenic HIV-1 
strain (such as contemplated above) as well as developing a therapeutic regimem aimed 
at inhibiting the activity of the nef gene product for the function of the LTR region. 
According to this latter embodiment, the present invention provides a therapeutic 
composition comprising a molecule capable of inhibiting the intracellular activity of the 
nef gene product and/or LTR region, said composition further comprising one or more 
pharmaceutically acceptable carriers and/or diluents. 



25. 



30 
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The molecule contemplated by the above aspect of the subject invention may be a 
protein, polypeptide, peptide, chemical compound, sugar moiety or derivative of the nef 
gene product. The molecule will need to be capable of entering an infected cell. Where 
the molecule is a protein, polypeptide or peptide, it may be encoded by the second 
5 nucleotide sequence on the targeting vector as contemplated above. Alternatively, the 
molecule may be a nucleic acid molecule capable of targeting the nef gene or LTR 
region. 

The deletion mutants of the present invention may result in a modified we/gene product 
10 either having no readily discernable activity or having activity different to the naturally 
occurring nef protein. In any event, if a mutant nef gene product is produced, it will 
generally have a lower molecular weight than the naturally occurring we/protein and will 
have a different overall amino acid sequence. This provides, therefore, for a means for 
diagnosing individuals with benign fflV-1 infection by, for example, assaying for a 
15 modified nef protein or screening for a modified nef gene sequence. Alternatively, 
benign HIV-1 infection may be detected by assaying for a modified LTR region such 
as an altered nucleotide sequence. 

According to one embodiment, there is provided a method for determining the 
20 pathogenicity of an HIV-1 strain after said HIV-1 strain infects cells of an individual, 
said method comprising contacting a biological sample from said individual with an 
effective amount of an antibody specific to a nef protein from a non-pathogenic strain 
of HIV-1 (as hereinbefore defined) for a time and under conditions sufficient to form 
an antibody-modified nef protein complex and then detecting said complex. The 
25 presence of said complex is indicative of a modified nef gene product and of the non- 
pathogenicity of the strain of HIV-1. The biological sample is a sample likely to contain 
the modified we/gene product such as tissue extract or cell extract of an infected cell. 
However, where the modified nef gene product is capable of permeation or transport out 
of the cell, suitable biological fluid would include serum, whole blood, lymph and 
30 mucosal secretion amongst other fluids. Many variations in the subject assay are 
possible and are contemplated herein. For example, an assay could be based on the 
inability for a nef specific antibody to bind to a modified nef protein. For the purposes 
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of the present invention the term "contacting" including "mixing". 

The presence of a modified nef molecule in biological fluid can be detected using a wide 
range of immunoassay techniques such as those described in US Patent Nos. 4,016,043, 
5 4,424,279 and 4,018,653. These include both single-site and two-site, or "sandwich", 
assays of the non-competitive types, as well as in the traditional competitive binding 
assays and include ELISA and RIA techniques. Sandwich assays are among the most 
useful and commonly used assays and are favoured for use in the present invention. A 
number of variations of the sandwich assay technique exist, and all are intended to be 

10 encompassed by the present invention. Briefly, and by way of example only, in a 
typical forward assay, a modified nef product-specific antibody is immobilised onto a 
solid substrate to form a first complex and the sample to be tested for modified nef 
product brought into contact with the bound molecule. After a suitable period of 
incubation, for a period of time sufficient to allow formation of an antibody-modified 

15 t nef product secondary complex, a second modified nef protein antibody, labelled with 
a reporter molecule capable of producing a detectable signal, is then added and 
incubated, allowing time sufficient for the formation of a tertiary complex of antibody- 
modified nef product-antibody. Any unreacted material is washed away, and the 
presence of the tertiary complex is determined by observation of a signal produced by 

20 the reporter molecule. The results may either be qualitative, by simple observation of 
the visible signal or may be quantitated by comparison with a control sample containing 
known amounts of hapten. Variations of the forward assay include a simultaneous assay, 
in which both sample and labelled antibody are added simultaneously to the bound 
antibody, or a reverse assay in which the labelled antibody and sample to be tested are 

25 first combined, incubated and then added simultaneously to the bound antibody. These 
techniques are well known to those skilled in the art, and the possibility of minor 
variations will be readily apparent. The antibodies used above may be monoclonal or 
polyclonal. 

30 The solid substrate is typically glass or a polymer, the most commonly used polymers 
being cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. 
The solid supports may be in the form of tubes, beads, discs or microplates, or any other 
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surface suitable for conducting an immunoassay. The binding processes are well-known 
in the art and generally consist of cross-linking covalently binding or physically 
adsorbing the molecule to the insoluble carrier. 

5 By "reporter molecule", as used in the present specification, is meant a molecule which, 
by its chemical nature, produces an analytically identifiable signal which allows the 
detection of antigen-bound antibody. Detection may be either qualitative or quantitative. 
The most commonly used reporter molecule in this type of assay re either enzymes, 
fluorophores or radionuclide containing molecules (i.e. radioisotopes). In the case of an 

10 enzyme immunoassay, an enzyme is conjugated to the second antibody, generally by 
means of glutaraldehyde or periodate. As will be readily recognised, however, a wide 
variety of different conjugation techniques exist which are readily available to one 
skilled in the art. Commonly used enzymes include horseradish peroxidase, glucose 
oxidase, P-galactosidase and alkaline phosphatase, amongst others. The substrates to be 

15 used with the specific enzymes are generally chosen for the production,upon hydrolysis 
by the corresponding enzyme, of a detectable colour change. It is also possible to 
employ fluorogenic substrates, which yield a fluorescent product. 

Alternatively, fluorescent compounds, such as fluorescein and rhodamine, may be 
20 chemically coupled to antibodies without altering their binding capacity. When activated 
by illumination with light of a particular wavelength, the fluorochrome-labelled antibody 
adsorbs the light energy, inducing a state of excitability in the molecule, followed by 
emission of the light at a characteristic colour visually detectable with a light 
microscope. As in the EIA, the fluorescent labelled antibody is allowed to bind to the 
25 first antibody-hapten complex. After washing off the unbound reagent, the remaining 
complex is then exposed to the light of the appropriate wavelength, the fluorescence 
observed indicates the presence of the hapten of interest. Immunofluorescence and EIA 
techniques are both very well established in the art and are particularly preferred for the 
present method. However, other reporter molecules, such as radioisotope, 
30 chemiluminescent or bioluminescent molecules, may also be employed. It will be 
readily apparent to the skilled technician how to vary the procedure to suit the required 
purpose. It will also be apparent that the foregoing can be used to label a modified nef 
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product and to use same directly in the detection of, for example, circulatory antibodies 
specific to said modified nef product. 



Alternatively, genetic assays may be conducted to screen for abberations in the nef gene 
5 and/or LTR region. Such a genetic assay may be by Southern or Northern blot analysis, 
PGR analysis or the like using oligonucleotides specific to a deleted region of a nef gem 
and/or LTR region. 

According to this embodiment there is provided a method for determining the 
10 pathogenicity of an HIV-1 strain after said HIV-1 strain infects cells of an individual, 
said method comprising determining directly or indirectly the presence of a deletion 
mutation in the genome of said HIV-1 wherein the presence of a such a mutation is 
indicative of the presence of a non-pathogenic strain of HIV-1 . The deletion mutation 
may result in the genome being unable to synthesise a polypeptide or protein from a 
15 pathogenic strain of HIV-1 or may direct the synthesis of a truncated form of said 
polypeptide or protein. The mutation may also lead to altered expression of a 
polypeptide detectable by, for example, decreased synthesis of a particular protein, such 
as the nef gene product. Alternatively, the deletion mutation affects the LTR region or 
a regulatory region of the HIV-1 genome. In either case, affected viruses may also be 
20 detected by, for example, observing low viral copy numbers such as low viral loads. 

Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides within the region from nucleotide 8787 to nucleotide 9709 using the 
nucleotide numbering of HIV-1 strain NL4-3. 

25 

Preferably, said non-pathogenic HTV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (i) 8830-8862; 

(ii) 9009-9035; 
30 (iii) 9019-9029; and 

(iv) 9033-9049. 
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Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 
5 (vii) 9105-9224; and 

(viii) 9271-9370. 

Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
10 nucleotide (ix) 8882-8928; 



(x) 8850-9006 

(xi) 8792-9041 

(xii) 9112-9204. 



and 



15 Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

20 

The above nucleotide numbers are based on the nucleotide numbering in the NL4-3 
genome. 

Particularly preferred oligonucleotides are based on the deleted regions of the nefgene 
25 and/or LTR region such as but not limited to one or more oligonucleotides based on 
SEQ ID NO: 2 to SEQ ID NO: 613 and/or SEQ ID NO: 652 to SEQ ID NO: 799. 

The present invention further extends to kits for the diagnosis of infection by pathogenic 
strains of HIV-1 or for determining the pathogenicity of infecting virus. The kits would 
30 be in compartmental form each comprising one or more suitable reagents for conducting 
the assay. 
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The present invention is further described by the following non-limiting Figures and 
Examples. 

In the Figures: 

5 

Figure 1 is a representation showing the alignment of the nucleotide sequences from 
donor D36 peripheral blood mononuclear cell (PBMC) [D36P] and non-pathogenic HIV- 
1 from recipient C18 HIV StV [C18S], C18 HIV^ [C18M] and C98 HIV [C98H] and 
C54 PBMC [C54P] with the equivalent region of HIV-1 NL4-3. Sequences labelled 

10 PBMC are from patient PBMC, those labelled HIV are from virus isolated from patient 
PBMC and grown in culture. Numbering for HIV-1 NL4-3 is as per Myer et al (1992) 
where nucleotide 1 is the first nucleotide of the complete proviral DNA sequence. 
D36P, C18S, C18M, C98H and C54P are numbered from the start of the region 
sequenced. Identity with NL4-3 sequence is shown by (*). Deleted nucleotides are 

15 shown by (-). Spaces introduced to maximise alignment are shown by (,). Features in 
NL4-3 are marked by overlining the sequence, features in D36 and CI 8 are marked by 
underlining the sequence. 

Figure 2 shows the alignment of encoded amino acid sequences of (a) tat exon 3 and 
20 (b) rev exon 3 from HIV-1 NL4-3, D36 PBMC, C18 HIV StV and C98 HIV. In-phase 
termination codons (*) and NL4-3 encoded amino acid numbers are shown. 

Figure 3 is a representation showing the alignment of C-terminal envelope glycoprotein 
gp41 amino acid sequences encoded by D36 PMBC, CI 8 HIV stv , CI 8 HIV,^ and 
25 C98 HIV. Numbering is that of the amino acid sequence of the mature envelope 
glycoprotein of HIV NL4-3. Termination in shown by (#). 

Figure 4 is a representation showing alignment of amino acid sequences encoded by the 
«e/genes of HIV-1 NL4-3, D36 PBMC, C18 HTV StV , C18 HIV^ and C98 HIV. In 
30 phase termination codons are shown by (#). Identical amino acids are shown by (*). 
Residues underlined are those immediately before a deletion. 



BNSDOCID: <WO 9521912A1 J_> 



WO 95/21912 



PCT/AU95/00063 



10 



- 41 - 



Figure 5 shows a duplication of NF/CB and Spl sequences in D36 PBMC, CI 8 HIV StV , 
C18 UIVmbc 311(1 C98 HIV demonstrated by alignment of their low homology region 
sequences with the NFKB-Spl region of HIV-1 NL4-3. Nucleotide numbering 
according to Figure 1. Identity with NL4-3 sequence shown by (*) and NFKB and Spl 
sites in NL4-3 overlined. Position of neffLTR region sequence deletion shown by (a). 

Figure 6 is a graphical representation showing replication of C18 and C98 viral isolates 
and D36 PBMCs from asymptomatic patients in PHA-stimulated PBMCs. 




Figure 7 is a graphical representation showing replication of viral isolates from 
15 asymptomatic patients in non-PHA stimulated PBMCs. 




20 

228200 is an Australian isolate of HIV-1 described by Kiernan, R. et al (1990). Its 
characteristics include being T cell trophic, with fast kinetics, high producer of HIV-1 
and/or SI phenotype. 

25 Figure 8 is a graphical representation of cell surface receptor expression for syncytia- 
inducing (SI)/ non-syncytia-inducing (NSI)/asymptomatic patient isolates. 



■ 


TfR 




IL-2R 


■ 


CD4 



30 



W095/21912 



PCT/AU95/00063 



42 



5 



228200 is defined in the legend to Figure 7. 243925 is a viral isolate of HIV- 1 which 
is monocyte/macrophage trophic and exhibits NSI phenotype (Dr Karen Coats-Fryer, 
PhD thesis entitled "Viral determinants of HIV- 1 syncytium formation", the University 
of Melbourne, Parkville, Victoria, Australia. 



Figure 9 is a representation of the nucleotide sequence of CI 8 HIV-1 MBC (SEQ ID NO: 
800). 

1 0 Figure 10(a) - (g) are graphical representations showing clinical immunology of cohort; 
(a) CDS; (b)(i) CD4 (ii) CD4%; (c)(i) CDS; (ii) CD8%; (d) lymphocyte count; (e) 
CD4/CD8 ratio; (f) p-2-microglobulin; and (g) Kaplan-Meier estimates of time to 
disease progression (Cox & Oakes, 1989). 

15 Figure 11 is a schematic representation of the deletion mutants of the present invention. 
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A summary of the SEQ ID Nos; used in the subject specification is shown below: 



5 


SEQ ID NO: 


DESCRIPTION 




1 


Nucleotide sequence of HIV- 1 strain NL4-3 genome 




2-613 


Decanucleotides of nef gene from HIV-1 strain NL4-3 


10 


614 


Partial nucleotide sequence of D36 HIV-1 isolate 




615 


Partial nucleotide sequence of C18 HIV- 1 }ABC isolate 




616-625 


PCR primers shown in Table 1 




696-633 


Sequence primers shown in Table 2 




OD** 


Amino acid residues 15-27 of HIV-1 NL4-3 nef protein 




odd 


HIV-1 NL4-3 tat exons (Figure 2) 




OjO 


HIV-1 D36 tat exons (Figure 2) 




637 
UJ / 


HIV-1 CI 8 tat exons (Figure 2) 






HIV-1 NL4-3 rev exons (Figure 2) 




OD7 


HIV-1 D36 rev exons (Figure 2) 


20 


04l) 


HIV-1 CI 8 rev exons (Figure 2) 




04 1 


HIV-1 NL4-3 C-terminal of gp41 (Figure 3) 




t>4Z 


HIV-1 D36 C-terminal of gp41 (Figure 3) 




AA1 


HIV-1 C18 C-terminal of gp41 (Figure 3) 




AAA 


HIV-1 NL4-3 nef gene (Figure 4) 


Id 


6AS 


HIV-1 D36 nef gene (Figure 4) 




646 


HTV-1 CI 8 nef sene (Figure 4) 




647 


HIV-1 NL4-3 NFKB/SP1 sequence (Figure 5) 




648 


HIV-1 D36 NFKB/SP1 sequence (Figure 5) 




649 


HIV-1 CI 8 NFKB/SP1 sequence (Figure 5) 


30 


650 


Nucleotide sequence of nef gene from HIV-1 strain NL4-3 




651 


Nucleotide sequence of env and nef regions of NL4-3 




652-799 


Decanucleotides of LTR region from HIV-1 strain NL4-3 




800 


Nucleotide sequence of CI 8 HlV-l^gc 
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EXAMPLE 1 

Source Material 

For the purposes of the following examples, a non-pathogenic HIV-1 strain was isolated 
from a recipient of HIV-1 infected blood. The recipient is designated "CI 8". Other 
recipients are defined as "C54" and "C98". The donor is identified herein as "D36". 
The place of isolation may be indicated after the abbreviation of "HIV". For example, 
St Vincents Hospital, Sydney (HIV StV ) or Macfarlane Burnet Centre of Medical 
Research, Melbourne (HIV,^). 



Exemplary viral isolates referred to herein as "CI 8" and "C98" Were deposited at the 
PHLS Centre for Applied Microbiology and Research, European Collection of Animal 
Cell Cultures (ECACC), Division of Biologies, Porton Down, Salisbury, Wiltshire SP4 
OJG. CI 8 was deposited on 17 October, 1994 under Provisional Accession Number 
15 V94101706 and C98 was deposited on 31 October, 1994 under Provisional Accession 
Number V94 103 11 69. 

Figure 1 1 is a summary of the deletion mutants of the present invention. 

20 Viruses were isolated by the following procedures: 

1. Infected peripheral blood mononuclear cells (PBMCs) were co-cultured with 
HIV-1 seronegative donor PBMCs. A convenient source of seronegaitve donor 
PBMCs is a blood bank. The supernatants and cells are harvested every 7 days 
and fresh medium added with CD8 depleted PBMCs. CD8 depletion promotes 
the ability to isolate HIV-1. The culture and procedure is continued for up to 
approximately 5 weeks; 



2. The infected PBMCs are purified from whole blood and these cells are cultured 
alone for up to 5 weeks. PMBCs alone are used because the virus is more likely 
30 to be monocytotropic. Fresh medium is added weekly and supernatant fluid is 

harvested at this time; 
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3. Supernatant fluids are harvested every approximately 7 days, fresh medium and 
fresh HIV-1 seronegative CD8 depleted PBMCs are added at this time; 

4. HIV-1 seronegative PBMCs are pretreated with with M-CSF for approximately 
72 hours prior to the addition of infected PBMCs. M-CSF has been shown to 
enhance HIV-1 replication in monocytes (Gendelman et al, 1988); or 



The supernatant fluid is harvested from the cultures of step 4 every 
approximately 7 days, fresh medium added together with HIV-1 seronegative 
stimulated CD8 depleted PBMCs. The virus is isolated from the infected 
PBMCs. 

EXAMPLE 2 

DNA Preparation and PCR Amplification 
15 Non-pathogenic HIV-1 (e.g. strain CI 8) infected peripheral blood mononuclear cells 
(PBMC) were harvested 4 days after infection of phytohaemagglutinin (PHA) stimulated 
HV-1 negative donor PBMC cultured by the method of Neate et al (1987) and washed 
in phosphate buffered saline (PBS). PBMC from Donor D36 and Recipients CI 8, C54 
and C98 were prepared by Ficoll isopaque centrifugation of buffy coat cells and washed 
20 with PBS. 



5. 

10 



Approximately 10 7 cells were lysed in 1ml lysis solution (0.45% v/v NP40, 0.45% v/v 
Tween 20, lOmM Tris-HCl pH 8.3, 40mM KC1 2.5 mM MgCl 2 ) and digested with 
60ug/ml proteinase K (Boehringer Mannheim) at 55°C for 1 hour followed by 100°C 
25 for 10 minutes. Lysates were stored at -20°C. 

All polymerase chain reaction (PCR) primers (Table 1) and sequencing primers (Table 
2) were synthesised using an Applied Biosystems model 391 DNA synthesiser using 
phosphoramidite chemistry. 

30 

Strict physical separation was maintained for sample, PCR reagent mix and PCR reaction 
preparations as well as amplification and analysis. Final reaction mixes (50ul) contained 
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2 jal neat or diluted cell lysate, 0.2|iM each primer, 200mM dNTPs and 1.25 units Taq 
polymerase (Boehringer Mannheim) in PCR buffer, (1 OmM Tris-HCl pH8.3, 50mM KCI, 
lOOjag/ml gelatine) adjusted to the optimum MgCl 2 concentration for the primer pair 
(1.5-3.0mM). Aliquoted reagent mix was overlaid with 50^1 mineral oil prior to 
5 addition of DNA template lysate. After template denaturation at 94°C for 3 min 
amplification was achieved with 30 cycles of 94°C, 1 min; 55°C, 1 min; 72°C, 2 mins. 
A final elongation reaction was conducted at 72°C for 7 minutes. For double PCR 
amplification 2 \xl of first round product was added to the second reagent mix and 
amplified as before. 

10 

PCR amplified DNA was checked for quality, quantity and fragment size by agarose gel 
electrophoresis in Tris-Acetate-EDTA buffer (Sambrook et al, 1989) stained in ethidium 
bromide and viewed by UV transillumination. 

15 EXAMPLE 3 

DNA Sequence Analysis 
The DNA sequence of PCR amplified HIV-1 regions was determined by the 
dideoxynucleotide method (Sanger et al, 1997) using Sequenase T7 polymerase (United 
States Biochemicals). 

20 

PCR amplified DNA was purified by PCR Magic prep resin chromatography (Promega). 
Approximately 2 to 7 ^ig purified DNA plus lOng specific primer (Table 2) were 
denatured by boiling for 3 mins and snap frozen to -20°C. The initial labelling reaction 
was for 3 minutes at 22°C (room temperature) with 35 SdNTP (500Ci/mmol; Dupont) 

25 followed by dideoxynucleotide termination reactions at 37°C for 5 minutes. NP40, to 
0.45% v/v, was included in denaturation and reaction mixes (Bachman et al, 1990). 
Sequencing reaction products were denatured in formamide and resolved on a 6% w/v 
poly aery lamide gel containing 8M urea, fixed in 10% v/v acetic acid, 10% v/v methanol 
and dried. Following autoradiography on XK1 film (Kodak) the gel sequences were 

30 read assembled, translated to protein and aligned using the PC/GENE suite of programs 
(IntelliGenetics, USA). 
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TABLE 1 
PCR PRIMERS 



PRIMER 


SEQUENCE 1, 3 


POSITION 2 


CM 


TGG AAGGGCTAATTTGGT(6 1 6) 


1-18 


C1-2 


ATCTTCCCTAAAAAATT AGCCTGTC(6 1 7) 


2099-2075 


LTR-3' 


AGGCTC AGATCTGGTCTAAC(6 1 8) 


9559-9540 


oivOO 


A GC AGCAGGAA GO A C T ATGGf 6 1 9) 


7786-7805 


Cl-6 


TGCTAGAGATTTTCCACAC(620) 


9709-9691 


KS-2 


AGTG AAT AGAGTT AGGC AGG(62 1 ) 


8326-8345 


RT5'-v3 


GTAAGACAGTATGATCAGATA(622) 


2418-2438 


RT3'-v2 


TTGTAGGGAATTCCAAATTCC(623) 


4660-4640 


RT5'-v2 


CAGGATCCTACACCTGTCAACATAAT(624) 


2487-2506 


RT3'-vl 


GGGAATTCCTTATTCCTGCTTG(625) 


4655-4634 



15 



1. Sequence is presented from 5' to 3' of the primer. 

2. Position is according to the numbering of HIV NL4-3 in Myers et al (1992). 

3. SEQ ID NOs are given in parentheses. 



WO 95/21912 



PCT/AU95/00063 



- 48 - 



TABLE 2 
SEQUENCING PRIMERS 



PRIMER 


SEQUENCE 1, 3 


POSITION 2 


KS3 


CCAGAAGTTCCACAATCC(626) 


8570-8553 


KS4 


TTCTTCT AGGT ATGTGG AG(62 7) 


8753-8735 


KS5 


AGTGAATTAGCCCTTCCAG(628) 


9093-9075 


KS6 


TGCTAGAGATTTTCCACAC(629) 


9709-9691 


SP2 


TGCTCTGGAAAACTCAT(630) 


8006-8022 


SP3 


CTTTCTATAGTGAATAGAG(63 1) 


8318-8336 


SP4 


TATTGGAGTCAGGAACT(632) 


8618-8634 


SPR 


GGTCT AACC AGAGAG AC(63 3 ) 


9547-9531 



15 1. Sequence is presented from 5' to 3' of the primer. 

2. Position is according to the numbering of HIV NL4-3 in Myers et al (1992). 

3. SEQ ID NOs are given in parentheses. 



EXAMPLE 4 

20 Cells and Cell Culture 

Peripheral blood was obtained from HIV-1 sero-negative volunteers and mononuclear 
cells prepared by centrifugation on a Ficoll/Hypaque density gradient (Peper et al, 1968). 
PBMC were activated with phytohemagglutinin (PHA; lO^g/lO 6 cells) for 48 h at 37°C 
washed and then cultured in RPMI 1640 medium containing 10% v/v heat inactivated 

25 foetal calf serum, 15mM HEPES, 0.1% v/v sodium bicarbonate, 25jig/ml polybrene 
(Sigma), 10% v/v interleukin 2 (Boehringer Mannheim) and 1:1000 anti-interferon 
(Miles) (IL-2 medium). Non-PHA stimulated cells were prepared in a similar manner 
except they were cultured in medium lacking PHA and IL-2. 
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EXAMPLE 5 




Antibodies specific for HIV-1 Nef were raised against a peptide corresponding to the 
predicted amino acid residues 15-27 (AVRERMRRAEPAA SEQ ID NO: 634) of Nef 

5 encoded by the HIV-1 clone pNL4.3 (Kemp et al, 1988). The peptide was conjugated 
to keyhole limpet hemocyanin (KLH; Calbiochem, Behring Diagnostics, CA) via 
glutaraldehyde and this complex used to immunise sheep (0.5mg peptide 
conjugate/sheep). Antibodies to the peptide were purified by affinity chromatography. 
Reactivity of the antibodies with recombinant HIV-1 Nef 25 and 27 was demonstrated 

10 by immunoblotting. 



Seven days post-infection HIV-1 C18-infected PBMCs and mock - infected cells were 

15 washed in PBS then lysed (0.5% w/v NP-40, 0.5% w/v sodium deoxycholate, 50mM 
NaCl, 25mM Tris-HCl, lOmM EDTA, 0.01% w/v sodium azide and lOmM 
phenylmethylsulphonylfluoride). After nuclei were spun out lysates were 
electrophoresed in a 13% w/v SDS-polyacrylamide gel (SDS-PAGE) and subsequently 
transferred to Hybond-C nitrocellulose (Amersham, Buckinghamshire, England) for 1 

20 h at 100 V using a Bio-Rad protein transfer cell (Bio-Rad, Richmond, Ca). Membranes 
were pre-incubated with 1% w/v BSA/PBS for 2 h at room temperature and then reacted 
with affinity purified sheep anti-Nef (15 . 27) , diluted 1 : 100, overnight at room temperature. 
After three washes in 1% w/v BSA/PBS, the blots were incubated with donkey anti- 
sheep Ig conjugated to biotin (Amersham, diluted 1:500) for 1 h at room temperature. 

25 After extensive washing as described above the membranes were incubated with 
streptavidin-conjugated horse radish peroxidase (Amersham; diluted 1:500 for 1 h at 
room temperature. All dilutions were made with 1% w/v BSA in PBS. After further 
washing the membrane was developed with phenylenediamine substrate (Dako, 
Dapopatts, Denmark). The antibody preparation used in the immunoblotting experiments 

30 was free of detectable antibodies to the immunogenic carrier protein and coupling 



EXAMPLE 6 



Reactivity of anti-Nef (15.27) 



with HIV C18-infected Cells in Immunoblotting 



reagent. 
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EXAMPLE 7 

Analysis by Polymerase Chain Reaction Amplification 
A 5' fragment defined by primers Cl-1 and Cl-2 containing the 5' LTR and part of the 
gag gene was amplified. DNA from HIV-1 CI 8 infected PBMC gave an amplified 
5 fragment (amplimer) of about 1 .9 kb compared with 2. lkb for pHXB2 control template, 
implying a deletion of about 200bp from HIV CI 8. Further amplification of this 
fragment with primers defining the U3 region of the LTR (Cl-1 and LTR-3') gave 
amplimers of about 300 bp for HIV-1 CI 8 infected PBMC DNA compared with 340bp 
for C18 and D36 PBMC DNA and 484 bp for pHXB2 control. This implies the loss of 
approximately 140 to 180bp from the U3 region of these proviral DNAs. 



10 



To analyse the nef-gene-3' LTR region, the nested primer pairs SK68-C1-6 and KS-2- 
LTR-3' were used in a double PCR. Amplimers of approximately 830bp were obtained 
for HIV-1 CI 8 infected PBMC DNA as well as for PBMC DNA from Donor D36 and 
1 5 Recipients CI 8, C54 and C98 compared with approximately 1230bp for pHXB2 DNA. 
These results suggest that about 400bp of DNA have been deleted from the Donor and 
Recipient proviral DNAs. 

In comparison, amplification of the polymerase gene region by double PCR with the 
20 nested primer pairs RT5'-v3-RT3'-v2 and RT5'-v2-RT3'-vl gave a fragment 
(approximately 2. 1 kb) the same size as the molecular clone pHXB2 fragment for HIV-1 
C18 infected PBMC DNA, suggesting that deletions from this region were unlikely. 

EXAMPLE 8 

25 Nucleotide Sequence of the nef-3'LTR Region 

PCR amplification experiments indicated an approximately 200bp nucleotide deletion 
from both the ne/gene and LTR regions of Donor D36 PBMC and Recipient CI 8 HIV-1 
proviral DNA. To further analyse these regions, the DNA sequence was determined for 
the PCR amplified nef-3 -LTR region of D36 PBMC, CI 8 isolates HTV,^ and HTV StV 

30 as well as isolate C98 HIV infected PBMC proviral DNA. The 3' region was amplified 
with outer primers (SK68-C16) and inner primers (SK68-LTR 3' or KS2-C16) and 
sequenced directly using a number of internal sequencing primers based on the HIV-1 
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NL4-3 nucleotide sequence (Table 2). 

Alignment of the nucleotide sequences of the amplified 3' region of donor D36 PBMC 
and recipient CI 8 isolates HIV^ and HIV StV and C98 HIV (Fig 1) showed a number 

5 of nucleotide sequences changes, including deletions, relative to the nucleotide sequence 
of wild-type infectious HIV-1 (HIV-1 NL4-3). In the region of alignment, D36 PBMC 
lacked 291 nucleotides, CI 8 HIV StV differed in size by 388 nucleotides (comprising 
deletions of 397 nucleotides and an insertion of 9 nucleotides), CI 8 HIV^c differed 
by 456 nucleotides and C98 HIV lacked 158 nucleotides compared with HIV-1 NL4-3. 

10 The overall identity with HIV-1 NL4-3 nucleotide sequence of D36 PBMC, CI 8 
HIV StV , HIV^c and C98 HIV nucleotide sequences, including deletions, was 73% 
(1157/1596), 67% (1459/1592), 62% (982/1592) and 79% (1105/1399), respectively. 

The D36 PBMC sequence differed from HIV-1 NL4-3 in a number of features. A 
15 change in the wild type tat termination codon from TAG to TCG (Ser) extended the 
third tat exon (which starts at splice acceptor 10) by a further 15 amino acids to 
terminate at a conserved TAG (Fig 1). The resulting C-terminal peptide is rich in 
charged amino acids (8/15) (Fig 2a). The wild type rev termination codon has also 
changed (TAG to GAG, Glu) and the third rev exon is extended for 14 codons to 
20 terminate at a conserved TAG (Fig 2b). The encoded extra amino acids are mainly 
polar (1 1/14) and charged in nature (Fig 2b). The sequence also encodes the C-terminal 
237 amino acids of Env gp41 (Fig 3) terminating at the normal termination codon. The 
D36 PBMC Env amino acid sequence has 85% identity with the HIV NL4-3 sequence, 
increasing to 89% if similarities are included. 



25 



30 



There are significant differences from HIV-1 NL4-3 downstream of the env (gp41) gene. 
A change in the fifth nef codon, from TGG (Trp) to TGA (Fig 1), introduces an early 
termination in the D36 PBMC «e/gene. The encoded Nef protein is identical to the N- 
terminal 4 amino acids of NL4-3 Nef (Fig 4). Following the early termination there are 
deletions of 33, 47, 93 and 91 nucleotides and a region of low sequence homology, 
compared with HIV NL4-3, prior to the wild type we/termination codon site (HIV NL4- 
3 nts 9405-9407). As well as removing a significant part of the nef gene, these deletions 
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also bring into phase a further 6 termination codons. While the polypurine tract (plus 
strand primer binding site) and the first 38 nucleotides of the LTR U3 region are 
perfectly conserved, downstream U3 region sequences are disrupted by the 93 and 91 
nucleotide deletions and the low homology region. The resulting U3 region lacks 
5 recognition sequences for the transcription factors c-myb, USF and TCFlct as well as 
one of the suggested NF-AT sites (Gaynor et al, 1992). Downstream from the 91 
nucleotide deletion, a 59 nucleotide region of low homology contains two extra NFKB 
enhancer sites 19 nucleotides upstream of the usual site of a pair of NFKB sites, the 
upstream one of which is altered in its S'-half in D36. Sequences further downstream 
10 are highly conserved with respect to HIV-1 NL4-3, including the position and number 
of Spl basal promoter sites, TATA box, TAR and polyadenylation signal sequences. 

Similar to D36 PBMC, the C18 fflV StV and HIV^ sequences show the tat third exon 
to be extended by 15 codons. All but two codons (altered by point mutations) are 
15 identical to those of D36 PBMC (Fig 2a). The rev third exon of both C18 isolates is 
also extended (Fig 2b) but by only three codons, identical to the first three codons of 
the D36 PBMC rev extension. The same 237 amino acid coding region of Env gp41 is 
found in both the CI 8 HIV DNA sequences (Fig 3) and shows 85% identify, increasing 
to 88% if similarities are included, with the same region of the NL4-3 Env gp41 

20 

It is in the nef gene and LTR regions that the major differences from wild-type HIV-1 
arise, just as in D36 PBMC. The nef gene of CI 8 HIV StV encodes 24 amino acids with 
9 of the 10 N-terminal being identical to the NL4-3 Nef protein (Fig 4). Thereafter, 
deletions of 177 and 1 1 nucleotides cause a frameshift and termination at the 25th codon 
25 (Fig 1). Downstream deletions of 120, 82 and 7 nucleotides cause further loss of wild 
type nef gene sequence and bring into phase a further three termination codons. 

The nef gene of CI 8 HIV^ encodes only 7 amino acids with only the initiator 
methionine identical to the NL4-3 Nef protein. This loss of identity and early 
30 termination is brought about by a 250 nucleotide deletion after the fifth nucleotide of 
the nef gene. Downstream deletions of 120 and 86 nucleotides cause further loss of 
wild-type nef gene sequences. In both CI 8 isolates there is perfect conservation of the 
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polypurine tract and 29/3 1 nucleotides at the 5' end of the U3 region immediately before 
the 120 nucleotide deletion (Fig 1). This deletion together with the downstream 82 and 
7 nucleotide deletions in HIV StV and 86 nucleotide deletion in HIV^c and the low 
homology region cause the loss of the 5' half of the NRT-1 site (Yamamoto et al 1992) 
5 and the downstream NFAT site. A third NFKB site is present 31 (HIV stV ) and 33 
(HIV^^ nucleotides upstream of the expected pair of NFKB sites which are 
themselves separated by 13 nucleotides instead of the 4 nucleotides in HIV-1 NL4-3. 
The 5'-most Spl site sequence is slightly altered but sequences downstream including 
the other 2 Spl sites, the TATA box, TAR and polyadenylation signal sequences are 
10 identical to HIV-1 NL4-3 sequence. 



The three sequences, D36 PBMC, CI 8 HIV StV and CI 8 HIVj^ show a number of 
similarities consistent with the transmission of virus from person D36 to person CI 8 as 
well as a number of differences indicating post-transmission divergence of sequence. 

15 All three have tat open reading frames (ORFs) extended by 15 codons. All three have 
extended rev ORFs. The new rev termination codon in both CI 8 HIV-1 isolates, three 
codons downstream of the NL4-3 rev termination codon, has a point mutation in D36 
PBMC to make a Glu codon so that it continues for a further 1 1 codons (Fig 2b) to 
terminate at a conserved TGA. The partial Env gp41 amino acid sequences are more 

20 closely related to each other (86% identity or 90% including similarities) than to HIV 
NL4-3 (85% and 89%, respectively). 

The nucleotide sequence of the nef and LTR region of the HIV-1 isolate from recipient 
C98 (C98 HIV) is 90.3% identical (1264/1399) to the HIV NL4-3 sequence, ignoring 

25 deletions. Similar to the D36 PBMC and CI 8 HIV StV and FHV^c isolates the C98 
HIV sequence shows the third exon of tat to be extended by 15 codons with all but one 
being identical to the D36 PBMC tat extension. Also, the rev gene is extended by 3 
codons, 2 of which are identical to the first 2 codons of the D36 PBMC rev extension. 
The sequence also encodes the C-terminal 223 amino acids of Env gp41 terminating at 

30 the normal termination codon. The C98 HIV Env amino acid sequence has 89% identity 
with HIV NL4-3 Env sequence, increasing to 92% of similarities are included. 
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As with the D36 PBMC and the C98 HIV isolate sequences it is the we/ gene and LTR 
regions that major differences from the HIV NL4-3 sequence arise. The «e/gene open 
reading frame of C98 HIV is much longer than in D36 PBMC, C18 HIV StV and 
fflV MBC> encoding 85 amino acids compared with 206 amino acids for NL4-3. Sixty 
5 eight of those 85 amino acids are identical to the N-terminal sequence of NL4-3 Nef. 
The single, small deletion (16 nucleotides) in the C98 HIV nef-alone regions (Table 3) 
occurs after ne/codon 82 causing a frameshift and termination after a further 3 codons 
at the start of the highly conserved polypurine tract sequence immediately before the 3'- 
LTR. The nef/LTR region has two deletions totalling 142 nucleotides. The 5'-most 
10 deletion of 42 nucleotides includes the splice acceptor 12 sequence. The NRT-1, dyad 
symmetry and myb response element sequences are all intact. However, the downstream 
100 nucleotide deletion includes sequences from the 3' end of the 5'-NF-AT and all of 
the 3' NF-AT sequences as well as the USF transcription factor recognition sequence. 
The downstream low homology region of 77 nucleotides lacks the TCF-loc sequence but 
15 has two additional NFKB sites 13 nucleotides apart and 26 nucleotides upstream of the 
3'-half-remmant of the normal 5' -NFKB site. Sequence downstream, including the 3'- 
NFKB site, the 3 Spl sites, TATA box TAR and polyadenylation signal sequences are 
all highly conserved. 

The main feature of the sequences is the series of deletions, with respect to HIV NL4-3, 
in the /«?/gene-3'-LTR region. These can be grouped into two regions namely the nef 
alone region, that part of the nef gene upstream of the LTR, and the neflLTR region, 
where the nef 'gene and LTR U3 regions overlap. The deletions in these regions of each 
of the sequences start and end at the same or similar positions (Table 3). The deletions 
are larger in C18 HIV StV and CI 8 HIV MBC sequences where totals* of 397 and 456 
nucleotides have been deleted (relative to NL4-3) compared to 291 nucleotides, from 
D36 and 158 nucleotides from C98 HIV. In the we/-alone region the two deletions in 
CI 8 HIV StV and the single deletion in CI 8 HIV^^ occupy the same region as the three 
deletions in D36 PBMC. Similarly, the nefTLTR region in the three deletions in the CI 8 
HIV StV , the two deletions in the CI 8 HIV MBC and the D36 PBMC sequences occupy 
the same region. These findings indicate that mutant virus was transmitted from D36 
to CI 8 after which further deletions and rearrangements occurred. Similarly, the 
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sequence of C98 HIV in the neflLTR region indicates two deletions occupying the same 
region as the nejJLTR deletions in D36 and the CI 8 sequences. However, the size (only 
16 nucleotides) and the position of the deletion in the «e/-alone region of C98 HIV are 
distinct from those of the D36 PBMC and CI 8 sequences. 

5 

The timing of transmission of virus by transfusion was that recipient CI 8 was transfused 
approximately 19 months after C98. Consistent with the relative timing of transmission 
and the sequence similarities and differences is the suggestion that at the time of 
transmission to C98, the D36 sequence had deletions in the ne/TLTR region but not in 

10 m?/-alone region. After transmission to C98, the C98 virus developed further deletions 
and rearrangements, including the deletion in the «e/-alone region. The D36 virus 
evolved so that at the time of transmission to CI 8, further deletions and rearrangements 
had occurred including deletion of sequence from the rar/-alone region distinct from the 
C98 HIV nef alone region deletion. After transmission to CI 8, further deletions and 

15 rearrangements occurred in the CI 8 virus giving rise to at least two sequences (HIV StV 

fflV MBc)' 



The nef-alone deletion region may be a mutation or recombination "hotspot" as it 
includes sequences that were found to be variably duplicated in 28 out of 54 Nef protein 

20 sequences derived from 8 of 12 patients analysed in a study (Shugars et al 1993). The 
sequence between the ne/alone and the neflLTR region deletions is highly conserved 
and is important in provirus integration into the infected cell genome and interacts with 
a number of cellular proteins. It is interesting that the sequence equivalent to NL4-3 
nucleotides 9209 to 9225 is retained in D36 and C98 HIV but lost in the CI 8 HIV 

25 sequences. This includes part of a sequence of dyad symmetry (9210 to 9231) and is 
a significant part of the binding site for NRT-1 (Yamamoto et al 1991) which has been 
shown to have a negative regulatory effect on HIV-1 expression. The presence of this 
sequence in D36 and C98 HIV and its absence from the CI 8 isolates may correlate with 
the inability to isolate virus from D36 PBMC and the poor replication of C98 HIV but 

30 the ability to isolate HIV-1 from CI 8 PBMC. The deletion of sequence equivalent to 
nucleotides 9281 to 9395 of NL4-3 causes the loss of some transcription factor binding 
sites including NFAT and USF from the D36, C18 HIV and C98 HIV sequences. 
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A further similarity between the D36, CI 8 HIV StV , CI 8 HIV MBC and C98 HIV 
sequences is a region of low homology to HIV-1 NL4-3 extending downstream of the 
ne/ILTR deleted region to the NFKB enhancer/Spl promoter site region. This low 
homology region in fact consists of incomplete duplications of part of the NFKB/Spl 
5 region (Fig 5) resulting in D36 and C98 HIV having 2 extra NFKB sites upstream of 
an altered 5' NFKB site while the CI 8 sequences have one extra NFKB site and altered 
spacing between the 5' and 3' wild type NFKB sites due to an insertion of 9 nucleotides. 

For the CI 8 and C98 HIV-1 isolates virus replication was assessed in PHA-stimulated 
10 and non-stimulated PBMCs (Figs 6 and 7). In PHA-stimulated PBMCs we also studied 
cell surface CD4 and IL-2R expression (Fig 8). In comparison with HIV-1 wild type 
SI and NSI isolates clearly both CI 8 HIVj^^ and C98 viruses are replication 
competent, though C98 HIV replicates more poorly than C18 HIV MBC and are of the 
NSI phenotype when syncytium formation and CD4 and IL-2R surface expression are 
15 taken into account. Additionally, and more surprisingly, these two viruses replicated 
almost as efficiently in non-PHA stimulated PBMCs when compared to a typical local 
wild type SI isolate (HIV-1 228200, Fig 7). 

When protein expression was assessed for CI 8 HIV MBC and C98 HIV^^ although 
20 structural proteins were identified, no typical Nef protein was seen in infected cells. 
However, analysis of cell ly sates prepared from PBMC infected with C18 HIV MBC or 
PBMC infected with C98 HIV MBC (which were subsequently stimulated by UV 
irradiation, see Valerie et al, 1988) by Western immunoblotting using two antibodies 
specific for the N-terminal region of Nef showed the presence of smaller proteins of 19 
25 kDa and 21 kDa, respectively. These proteins were not observed in mock-injected 
control PBMC lysates and were not observed when the infected-cell lysates were probed 
were probed with antibodies reactive with the C-terminus of Nef. 

Thus, although the CI 8 and C98 HIV isolates are replication competent in vitro they 
30 clearly replicate differently using different conditions for cell activation and from the 
known functions of HIV-1 Nef protein and the LTR show that the major deletion in the 
nef gene and/or the LTR is at least in part responsible for the outcome of infection, 
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implicating the importance of Nef and/or the LTR in the clinical outcome of infection 
in vivo. 

EXAMPLE 9 

5 Determination of Degree of Relatedness Between Viruses 

To determine the degree of relatedness between viruses such as between mutants or 
between mutants and a wild-type virus and to ascertain putative infected patients, the 
method of Delwart et al was employed. 

10 EXAMPLE 10 

Immune responsiveness of subjects infected by non-pathogenic HJV-l isolate 
In this example, the donor and recipients of the cohort were tissue typed and assessed 
for basic cellular immune responses. Proliferative responses and IL-2 production to the 
mitogens ConA and PHA, to allogeneic mononuclear cells (irradiated pooled 
15 mononuclear cells from 20 random donors) and to recall antigens (e.g. influenza and 
tetanus toxoid) were within normal ranges. While at the immunogenetic level, HLA 
typing failed to identify a consistently common allele or haplotype within the group. 

The conservation of CD4+ counts observed in the cohort, the relative integrity of their 
20 immune system and the varied HLA types of the donor and recipients further supports 
the fact that the symptomless condition of the cohort members is due to a non- 
pathogenic strain of HIV-1 or a strain of low virulence. 

Accordingly, this provides a screening procedure for subjects putatively infected by a 
25 non-pathogenic HIV-1 isolate where such subjects are seropositive for HIV-1 (e.g. have 
antibodies to an HIV-1 glycoprotein) yet have normal proliferative responses and 
cytokine production to mitogens, allogeneic mononuclear cells and to recall antigens. 
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EXAMPLE 11 

Clinical Immunology of Cohort 
To establish that the donor and the recipients belonging to the cohort exhibit normal 
5 immunological profiles, members of the cohort were assayed for CD3, CD4, CD8, 
lymphocyte count, CD4/CD8 ratio and p-2-microglobulin over time since 
seroconversion. 

Parameters considered normal in non-infected individuals range as follows: 

10 



620-2200 (xl0 6 /L) 
420-1410 (xl0 6 /L) 
200-980 (xl0 6 /L) 



Parameter 

CD3 55-82% 

CD4 29-58% 

CD8 12-43% 

15 Lymphocyte count 1000-3500 (xl0 6 /L) 

CD4/CD8 0.7-3.7 

p-2-microglobulin 0.00-2.20 mg/L 



20 The results are shown in Figures 10(a)-(g) and clearly show that the immunological 
profiles of cohort members are substantially normal further highlighting the non- 
pathogenicity of the fflV-1 isolates of the present invention. Figure 10(g) shows a graph 
of the Kaplan-Meier (Ox and Oates, 1989) estimates of time to disease progression 
(AIDS or CD4 > 250). The results demonstrate that the difference is large in spite of 

25 the conservative bias, with a median time to progression of 6.2 years in the main 
database. An exact logrank test (Cytel Software Corporation, 1989, StatXact: Statistical 
Software for Exact Nonparametric Inference. Cambridge, Massachusetts.) was 
performed, demonstrating that the difference between the groups was highly statistically 
significant (logrank statistic 11.8, pO.OOOl). 

30 
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EXAMPLE 12 

Sequencing of isolate HIV-1 C18 MBC 
The genome of variant HIV-1 designated CI 8 HIV-1 MBC was amplified by the 
5 polymerase chain reaction (PGR) as 7 overlapping fragments using the sets of inner and 
outer oligonucleotide primers, designed using the programme PCRPLAN 
(IntelliGenetics), listed in Table 5 and either UlTma (Applied Biosystems) or a mixture 
of KlenTaq and Pfti (KlenTaq LA, Ab Peptides Inc) polymerases (for faithful 
amplification of long fragments). The resulting fragments were cloned into the Smal 

10 site of the plasmid vector pGEM7Zf-h Insert-containing clones representing each region 
of the full length variant HIV-1 were sequenced by a nested deletion strategy (Gou & 
Wu, 1982) and cycle sequencing with Taq polymerase and dye labelled primers 
complimentary to the T7 or SP6 sites within the cloning vector. Nucleotide sequences 
were entered and collated by ASSEMGEL and SEQIN (IntelliGenetics) and SEQED 

15 (Applied Biosystems) and translated to the encoded amino acid sequences using 
TRANSL (IntelliGenetics) programmes. Sequence alignments used NALIGN, 
CLUSTAL (IntelliGenetics) and SEQED programmes. 

The full length sequence (Fig 9; SEQ ID NO:800) of isolate HIV-1 C18 MBG is 9207 
20 nucleotides long which is 506 nucleotides shorter than the HIV NL4-3 sequence. This 
size difference is comprised of 126 nucleotides of insertions and 632 nucleotides of 
deletions, see Table 6. The most extensive differences between the HIV-1 C18x^ 
sequence and HIV-1 NL4-3 are in the U3 region of the LTR and in the nef gene, as 
hereinafter described. 

25 

The 5' LTR has deletions of 120 and 87 nucleotides and a region of low sequence 
homology, which is the result of an imperfect duplication of the downstream NFkB and 
Spl response sequences. These result in the loss of sequence from a number sites 
important in the regulation of transcription of HIV-1 genes, including the negative 
30 response element (NRE) and the response elements for a number of transcription factors 
including NF-AT, NRT-1, USF and TCF-lot. Furthermore, the low homology region 
contains an extra NFkB and Spl sites as well as an insertion of 9 nucleotides between 
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the usual NFkB sites. Downstream of the NFkB sites the sequence of the LTR has a 
high level of homology (96.2%) with the same region of HIV NL4-3. 

The gag gene contains 3 insertions, which represent direct repetitions of adjacent 
5 sequences. The first is a perfect repeat of 15 nucleotides after the equivalent of 
nucleotide 1134 of HIV-1 NL4-3 and adds 5 amino acids to the C-terminus region of 
pl7 gag . The remaining 2 insertions are imperfect and perfect repeats of 30 and 6 
nucleotides, respectively, after the equivalent of HIV NL4-3 nucleotides 2163 and 2232, 
respectively. These encode an extra 12 amino acids in the C-terminus region of pl5 gag 
10 just downstream of the gag to pol frameshift sequences. The variation in sequence 
length of the gag gene at these two positions is unusual. The homology of the encoded 
amino acid sequence of HIV-1 018^ and HIV NL4-3 for the gag pi 7, p24, and pl5 
proteins is 87.1%, 93.5% and 94.3%, respectively. 

15 In the pol ORF, the encoded proteins have high homology with the NL4-3 sequences 
being 95.5% overall comprising plO protease 92.9%, p66 reverse transcriptase 95.4% 
and p34 integrase 95.8%. The amino acid sequence of the p61 RT lacks the mutations 
associated with resistance to the nucleoside (AZT, ddl, ddC)and non-nucleoside 
(Nevirapine) analogue drugs used in the treatment of HIV-1 -infected persons. 



The vi/gene encodes a 192 amino acid protein with 88.0% homology with that of HIV-1 
NL4-3. The vpr gene encodes a 96 amino acid protein with 89.6% homology with that 
of HIV-1 NL4-3. 

25 There are 2 insertions and 1 deletion of sequences in the vpu gene. The insertions of 
3 and 9 nucleotides are after the equivalent of nucleotide 6071 and 6234, respectively, 
of HIV-1 NL4-3. These add 1 amino acid after amino acid 3, and 3 amino acids after 
amino acid 59 of the encoded Vpu protein. The deletion of 12 nucleotides after the 
equivalent of HIV-1 NL4-3 nucleotide 6261 deletes 4 amino acids from the C-terminal 

30 region of Vpu as well as from the signal peptide of the env polyprotein, which is 
encoded by an overlapping reading frame. Amino acid sequence homology of HIV-1 
CISmbc Vpu with NL4-3 is 85.2%. 



20 
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The sequence encoding the env gpl20 has 9 insertions totalling 45 nucleotides (encoding 
15 amino acids) and the deletion of a total of 18 nucleotides (encoding 6 amino acids). 
These are listed in Table 6. All of these events (insertions and deletions) are at positions 
in the env gene. This is within the env V3 coding region, immediately upstream of the 
5 sequence encoding the so called V3 tip (or loop) amino acid sequence, Gly Pro Gly Arg. 
The V3 region sequence is that of a typical clade B subtype (North America, Europe and 
Australia) being identical to the clade B consensus sequence (based on 186 env 
sequences) at 29/35 positions. The type of amino acid at positions 1 1 and 28 of the V3 
loop region (where position 1 is the Cys at amino acid 266 of the env gpl20) is 

10 predictive of the viral non-syncytium / syncytium forming phenotype (Fouchier et al, 
1992). The HIV-1 C18 MBC env gene encoded amino acid sequence has Ser at position 
1 1 and He at position 28 of the V3 loop region. The lack of a positively charged amino 
acid at both positions is strongly indicative of a non-syncytium viral phenotype. The 
overall amino acid sequence homology with HIV-1 NL4-3 (ignoring deletions and 

15 insertions) is 86.1%, comprising 85.5% for the gpl20 region and 87.6% for the gp41 
region. 

Both the tat and rev second exon open reading frames (ORF) are longer than in HIV-1 
NL4-3. A change of the tat termination codon from TAG to TCG extends the tat ORF 
20 to a downstream in phase termination codon extending the encoded tat amino acid 
sequence by 15 residues, compared with the 86 amino acid long NL4-3 tat protein, to 
a total length of 101 amino acids. However, this is the usual length of the HIV-1 tat 
protein. 

25 Similarly, the normal rev termination codon is changed from TAG to GAG. This 
extends the rev ORF to an in-phase termination codon 3 codons downstream so that the 
encoded Rev protein is 1 19 amino acids long instead of the usual 1 16. 

As mentioned above the most extensive differences between the sequences of the isolate 
30 HIV-1 018^^ and HIV-1 NL4-3 are in the nef gene and the LTR region. While the 
nef gene overlaps the 3' LTR, these differences are found in both the nef alone and the 
nef I LTR overlap regions. The HIV ClS^^encoded nef protein is only 24 amino 
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acids long compared with the normal length of 206 amino acids. This severe shortening 
of the m>/ protein is due to the deletion of 188 nucleotides (the 177 and 11 nucleotide 



TAG, at the resulting 25th codon. Downstream there is further loss of potential we/gene 
5 sequences by the 120 and 87 nucleotide deletions situated in the nef I LTR overlap 
region. The resulting 24 amino acid nef protein is identical to the N-terminus of the 
HIV-1 NL4-3 nef at 9 of the first 10 positions. Thereafter, homology is lost completely. 

Some sequences used in the generation of mature mRNAs are altered or lost in CI 8^^. 

1 0 The dinucleotide immediately after the splice donor site 2 (SD2) at nts 48 1 8-48 1 9 (NL4- 
3 equivalent nts 4963-4964)is changed from the conserved GT to GC. It is expected 
that this change would lead to loss of function of this site as a splice donor. Splice 
donor 2 is used in the processing of HIV-1 transcripts to some of the mRNAs that 
encode Tat, Rev and nef proteins. Similarly the splice acceptor site 7 (SA7) sequence 

15 at nts 6477-6478 (NL4-3 equivalent nts 6602-6603) is changed from the conserved AG 
dinucleotide to TC. This change is expected to lead to loss of function of this site as 
a splice acceptor. While this SA site is used in HIV-1 mRNA processing it is not a 
major site and is not used in the production of the regulatory proteins (Tat, Rev or nef) 
mRNAs. The splice donor 12 site is absent from the 018^^ sequence (NL43 

20 equivalent nts 9161-9162) as it is within the first deletion region in the nef I LTR 
overlap region which occurs at nt 8797 and results in the loss of NL43 nucleotides 9105 
to 9224. It is significant that the SA12 site is absent from the sequence of all of the 
cohort virus isolates so far obtained as well as from the sequence of D36 PBMC, 
however, the C54 PBMC sequence does contain the SA12 site. SA12 is not used in the 

25 processing of mRNAs that encode the viral regulatory proteins. Normally SA12 is used 
in splicing in conjunction with SD1, 2, 3 and 4 and the resulting spliced RNA is 
probably not a mRNA but may have a regulatory role involving binding to cellular 
proteins (Smith et al, 1992). 



deletions) from the ne/alone region which also brings into phase a termination codon, 



30 An interesting feature of the sequence of the HIV-1 C\%y^ c isolate is the deletion and 
rearrangement of sequence from the 5' -LTR U3 region and the deletion of sequence 
from the nef gene (both nef alone and nef I V LTR regions). These being the only 
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features of the sequence distinct from disease-causing HIV-1. The lack of AIDS or 
AIDS-Iike symptoms in the patient CI 8 is attributed to the effects of the loss of LTR 
sequence and / or the loss of nef coding sequences and their role in the pathogenesis of 
AIDS. 

5 . 

TABLE 4 

Primers used to Amplify Overlapping regions of HIV-1 C18 MBC 

10 Primer 5'- Direction Primer Sequence 

Coordinate ( +/ - ) Length (nt) 



15 



25 



30 



35 



CL *A 1 + 3 0 TGGAAGGG CTAATTTACTCC CAAAAAAGAC 

CL 14 896 - 25 AATCGTTCTAGCTCCCTGCTTGCCC 

^ 18 1 + 3 0 AATCCCGGG TGGAAGGGCTAATTTACTCCC 

^ 13 796 - 31 CCTCTAGA CCGCTTAATACTGACGCTCTCGC! 

20 CL 11 682 + 23 TCTCTCGACGCAGGACTCGGCTT 

CL 18 3440 - 30 CTGTTTTCTGCCAGTTCTAGCTCTGCTTCT 



CL 12A 732 + 26 TTTCCCGGGCGG CGACTGGTG AGTAC 

^ 17 3330 - 32 CCCTCTA GACTTGCCCAATTCAATTTTCCCAC 

CL 26 3193 + 3 9 CCACACCAGACAAAAAGCATCAGAAAGAAC^^ 

CL €B 96? 1 " 3 9 TGCTAlSAGATTTTCCACAOGGACnuVAAro 

27 3251 + 39 CCATCCTTXlATAAATGGACAGTACAACCCArACT 

CL 28 €39 - 37 TGG C C CAAACATT ATG T AC CT CTG CAT CAT ATG C 

CL 19 5448 + 30 AGCAGGACATAACAAGGT AGGAT CTCTACA 

CL24 8422 - 28 GGATCTGTCTCTGTCTCTCTCTCCACCT 



Underlined sequences depict added restriction enzyme site 

+ and - orientations refer to sense and anti sense strands of the double stranded DNA 
sequence 
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TABLE 5 

Sequence Deletions and Insertions in HIV-1 C18 MBC . 
Compared with HIV-1 NL43 



10 



15 



20 



25 



30 



35 



Gene or Region 


Position 

C18 MBC 


(nt) 
NL43 


Deletions 
(nt) 


Insertions 
(nt) 


5' -LTR U3 


29 


29 


120 


- 


5' -LTR U3 


85 


205 


87 


- 


5' -LTR U3 


154 


360 


- 


9 


gag pl7 


939 


1134 


- 


15 


gag pl5 


1982 


2163 


- 


30 


gag pis 


2081 


2232 


- 


6 


vpu 


5927 


6062 


- 


3 


vpu/env 


6092 


6234 


- .■ 


9 


vpu/ env 


6128 


6261 


12 


- ' 


env 


6483 


6628 


- 


6 


env 


6514 


6653 


2 


- 


env 


6524 


6665 


1 


- 


env 


6630 


6772 


- 


9 


env 


6646 


6778 


- 


3 


env 


7011 


7141 


6 


- 


env 


7140 


7276 


3 


- 


env 


7195 


7334 




6 


env 


7266 


7399 


3 




env 


7278 


7414 




6 


env 


7290 


7420 




2 


env 


7300 


7429 




1 


env 


7314 


7441 


3 




env 


7463 


7593 




3 


env 


7471 


7598 




9 


nef 


8711 


8829 


177 




nef 


8723 


9018 


11 




nef / LTR 


8798 


9104 


120 




nef / LTR 


8854 


9280 


87 




LTR U3 


8923 


9435 




9 



40 



632 



126 
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EXAMPLE 13 

Macrophage Isolates ofHIV-1 C18 and HIV-1 C98 

5 HIV-1 has been isolated from the macrophages of patients CI 8 and C98. 

Patient monocytes were prepared as follows. Whole blood was spun at 2000rpm for 10 
minutes. Plasma was removed into a separate tube and the remaining cells were diluted 
1:2 in PBS' (magnesium and calcium free phosphate buffered saline). This was 
10 underlaid with 10 ml of Ficoll Isopaque and spun at 2000rpm for 20 minutes. Cells 
were collected from the interphase and washed three times with PBS'. These cells were 
then seeded into a 6 well Costar tray at a concentration of L0 x 10 7 /mi and allowed to 
adhere for 1 hour. Any non-adherent cells were removed by aspiration. 

15 Donor HIV-1 negative macrophages for use in co-cultivation were prepared as follows. 
Peripheral blood mononuclear cells were purified from whole blood using 
Ficoll/Isopaque density gradient. These cells were seeded at a concentration of 2.0 x 
10 6 /ml in teflon. PBMC were cultured in the presence of 3/zg/ml of PHA and lOOOU/ml 
of M-CSF 3 days prior to co-culture. 

20 

On day of co-culture, donor PBMC were CD8 depleted. Dyna beads coated with anti- 
CD8 were used for this purpose. Dyna beads were washed once in PBS' and then 
applied to a magnet for 3 minutes. Supernatant was removed and the beads were then 
resuspended in 250/xl of RF-10. Aliquots of 2.0 x 10 s patient cells were then added to 

25 250/xl (3 beads : 1 CD8 T-cell) of Dyna beads and allowed to incubate for 30 minutes 
on ice with occasional mixing. After 30 minutes the cell suspension was placed onto 
a magnet for 3 minutes. The supernatant was then removed placed into a second tube 
containing 142 ti\ (1 bead : 1 CD8 T-cell) of Dyna beads. This suspension was placed 
on ice for an additional 30 minutes with occasional mixing. After 30 minutes cell 

30 suspension was placed onto a magnet for 3 minutes. Supernatant was removed and 
washed once in RF-10. 
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For co-culture, CD8 depleted PBMC were then added to patient monocytes. Half media 
changes were done every 7 days for a period of 21 days. Aliquots of 2.5 ml of medium 
was removed from these cultures and replaced with CD 8 depleted donor PBMC in 
Iscoves containing 10% HuS (Human serum), 5% v/v FCS and 5% w/v IL-2 and 
5 lOOOU/ml of M-CSF. Harvested supematants were spun at 1400 rpm for 10 minutes 
and stored as 1ml aliquots. Cell pellets were lysed in 200/il of lysis buffer for PCR 
analysis. Infection was quantitated using a p24 EIA Kit. 

Cells were harvested from the co-cultures and used to prepare DNA as described above. 
10 The nefl 3'-LTR region of both virus isolates was amplified by PCR using the above 
described primer sets and conditions (Example 12). The resulting amplimers were 
cloned into the plasmid vector pT7T3U19 and the nucleotide sequence determined by 
the Taq cycle sequencing method with dye-labelled primers. 

15 The CI 8 macrophage sequence has 3 deletions starting and finishing at positions within 
3 nucleotides of the same deletions in CI 8,^. The encoded nef protein is 3 amino 
acids long compared with 7 amino acids for CI 8^. The low homology region of the 
LTR U 3 region of CI 8 macrophage is very similar in sequence to CI 8,^ and 
similarly it has one extra upstream NFkB site. 

20 

On the other hand, the sequence of C98 macrophage has a number of differences from 
the C98 isolate. While it has exactly the same first deletion of 16 nucleotides just 
upstream of the polypurine tract (PPT), in the ne/-alone region, and exactly the same 
second deletion (position and size) it has an extra deletion of 18 nucleotides at NL4-3 
25 equivalent nucleotides 9206 to 9223. The final deletion is in approximately the same 
position as in the C98 isolate but is 5 nucleotides longer. The encoded we/protein is 34 
amino acids long compared with 86 amino acids for the C98 isolate. The low homology 
region is very similar to the C98 isolate, having the same 2 extra upstream NFkB sites 
and completely lacking the normal 5' -NFkB site. 

30 
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EXAMPLE 14 

Construction and Use of an Infectious Molecular Clone 

5 Molecular biological techniques can be used to construct a molecular clone of, for 
example, HIV-1 08,^^ Two schemes may be used. In the first scheme genomic 
DNA, extracted from either the CD4 positive PBMC of the patient C 1 8 or donor PBMC 
that have been infected with the isolate HIV-1 CI 8,^ , is used as the template for 
polymerase chain reaction amplification, using thermostable polymerase of high 

10 transcriptional fidelity (eg UlTma polymerase or KlenTaq/Pfu polymerase mixture), of 
long (6 to 7 kb) overlapping fragments representing the 5'- and 3'-parts of the HIV-1 
c18 mbc Pro viral genome of total length 9207nts. The amplified fragments may then 
be ligated together after digestion with a restriction enzyme that cleaves at a unique site 
common to the overlapping region of the amplified fragments, for example the unique 

15 Bgl I or Nco I sites. Ligation of this full length proviral DNA into a plasmid vector 
will allow its propagation in E coli and the subsequent preparation of large (mg) 
quantities of this molecularly cloned proviral DNA. 

In the second scheme donor PBMC that have been infected with the isolate HIV-1 
20 CI 8,^ are used as a source of non-integrated proviral DNA which can be extracted 
from the infected cells by the Hirt extraction method (Hirt, 1967). Circular proviral 
DNA molecules may be linearised by digestion with a restriction enzyme that cleaves 
at a unique position in the genome (eg the Bgl I or Nco I sites). The resilting linearised 
molecules can be ligated into a plasmid or, more usually, a bacteriophage lambda (X) 
25 based vector (eg Charon 4a, XWES) after modification of the end to provide blunt or 
cohesive ends compatible with the vector. Transformation or transduction of £ coli with 
the recombinant plasmid or bacteriophage material, respectively allows the propagation 
of the proviral DNA. Clones of E coli containing proviral DNA may be selected and 
DNA prepared. Molecular clones of retroviral genomes prepared in this way are often 
30 permuted. Rearrangement to the functional arrangement of sequences is achieved by 
restriction enzyme cleavage and religation of fragments to reconstruct the correctly 
permuted proviral genome. 
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The molecularly cloned DNA products of both schemes can be used to prepare variant 
pro viral genomes that may be used as the basis of a biologically attenuated HIV-1 
vaccine strain. Similarly, they may be modified to contain extra DNA sequences in the 
nef-alone deletion region that may deliver sequences that may be of therapeutic 
5 advantage (eg antisense or ribozyme sequences). 

Infectious virus particles of HIV-1 018^^ , or modified virus, can be produced by 
transfection of human cells (eg HeLa cells) which will produce, and release to the 
culture medium, virus particles of HIV-1 ClS^c , or modified virus. These virus 
10 particles can be used to infect a variety of CD4 positive cells for further propagation or 
experimentation. 

EXAMPLE 15 

In vivo Primate Model 

15 Following construction of infectious molecular clones of the mutant HIV-1 strains, 
studies are then undertaken in primates to establish attenuation, immunogenicity and 
vaccine prophylactic efficacy. All studies compare mutant clones of HIV-1 with 
isogenic wild-type (WT) virus. Initial studies are performed using the macaque (M. 
nemistrina) model of HIV-1 infection. Macaque-infectious WT HIV-1 and mutant 

20 clones are compared with respect to duration of viremia, anatomic sites of replication, 
and cellular and humoral immune responses. Where the mutant HIV-1 clones induce 
an immune response in the macques infected, challenge studies with WT virus are also 
performed. Studies are performed in a limited number of chimpanzees, generally in 
parallel with the macaque studies. Relevant mutations are engineered into WT HIV-1 

25 clones previously shown to produce chronic infection in chimps, and the course of chimp 
infection with mutant clones compared with historical controls. If infection is 
established, WT challenge studies is also performed. 
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Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
5 invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations of any two or more of said steps or features. 
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SEQUENCE LISTING 
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(i)(a) APPLICANT: (Other than US) 
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(B) STREET: 1 LITTLE COLLINS STREET 
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(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 
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(B) COMPUTER: IBM PC compatible 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TGGAAGGGCT 


AATTTGGTCC 


CAAAAAAGAC 


AAGAGATCCT 


TGATCTG TGG 




6 0 


CACAAGGCTA 


CTTCCCTGAT 




ACACACCAGG 




Af? AT ATrTAr* 


12 0 


TGACCTTTGG 


ATGGTGCTTC 


AAGTTAfTTAr 

nn\j x x n\j x 


rar;TTnAArr! 


AriAnr'AArrrA 


u/vriuLALju V, L-A 


18 0 


AATAAGGAGA 


GAAflAAf^Af": C 


Hull rt> — V— v_ 


r*T A TfZ 2 fTf C A 








AfWlAdAAfTT 




nJ\\3 1 I 1 oAl_A 




ft T*T w rr\ n "rr > ft /~< 




3 00 


AGCTGCATCC 


GGAGTACTAC 


ft ft ftf ft rTr rr 
AAAtsAL. 1 




TTTCTACAAG 


GGACTTTCCG 


360 


CTGGGGACTT 


TCCAGGGAGG 




OCvjVjtiACTvjtj 




GCCCTCAGAT 


420 


GCTACATATA 


AGCAGCTGCT 


TTITGCCTGT 


ACTGGGTCTC 


TCTGGTTAGA 


CCAGATCTGA 


480 


GCCTGGGAGC 


TCTCTGGCTA 


ACTAGGGAAC 


CCACTGCTTA 


AGCCrCAATA 


AAGCTTGCCT 


54 0 


TGAGTGCTCA 


AAGTAGTGTG 


TGCCCGTCTG 


TTGTGTGACT 


CTGGTAACTA 


GAGATCCCTC 


600 


AGACCCTTTT 


AGTCAGTGTG 


GAAAATCTCT 


AGCAGTGGCG 


CCCGAACAGG 


GACTTGAAAG 


660 


CGAAAGTAAA 


GCCAGAGGAG 


ATCTCTCGAC 


GCAGGACTCG 


GCTTGCTGAA 


GCGCGCACGG 


720 


CAAGAGGCGA 


GGGGCGGCGA 


CTGGTGAGTA 


CGCCAAAAAT 


TTTGACTAGC 


GGAGGCTAGA 


780 


AGGAGAGAGA 


TGGGTGCGAG 


AGCGTCGGTA 


TTAAGCGGGG 


GAGAATTAGA 


TAAATGGGAA 


840 


AAAATTCGGT 


TAAGGCCAGG 


GGGAAAGAAA 


CAATATAAAC 


TAAAACATAT 


AGTATGGGCA 


900 


AGCAGGGAGC 


TAGAACGATT 


ULjLIALi I IAA1 


CCTGGCCTTT 


TAGAGACATC 


AGAAGGCTGT 


960 


AGACAAATAC 


TGGGACAGCT 


AUAAl*,t_A 1 Ct, 


Li i L~ALjALAIj 


W\ 1 L_ALiAAlxA. 


ACTTAGATCA 


1020 


TTATATAATA 


CAATAGCAGT 






fin ft. *r r ft *tyt 


ft ii ft ft r* a ("ii /"v* 


1080 


AAGGAAGCCT 


TAGATAAGAT 






rrrftRnft lift 


nrzr^ a a r^/*' ft ft 


1140 


GCAGCAGCTG ACACAGGAAA 


CAACAGCCAG 


GTCAGCCAAA 


ATTACCrTAT 


A/TTY^r 1 AH A Af" 


1200 


CTCCAGGGGC 


AAATGGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAATGC 


ATGGGTAAAA 


1260 


GTAGTAGAAG AGAAGGCTTT 


CAGCCCAGAA 


GTAATACCCA 


TGTTTTCAGC 


ATTATCAGAA 


1320 


GGAGCCACCC 


CACAAGATTT 


AAATACCATG 


CTAAACACAG 


TGGGGGGACA 


TCAAGCAGCC 


1380 


ATGCAAATGT 


TAAAAGAGAC 


L_A. T L-AA TU AL> 


GAACX7TGCAG 


AATGGGATAG 


ATTGCATCCA 


1440 


GTGCATGCAG 


GGCCTATTGC 


ACCAGGCCAG 


ATGAGAGAAC 


CAAGGGGAAG 


TGACATAGCA 


1500 


GGAACTACTA 


GTACCCTTCA 


GGAACAAATA 


GGATGGATGA 


CACATAATCC 


ACCTATCCCA 


1560 


GTAGGAGAAA 


TCTATAAAAG 


ATGGATAATC 


CTGGGATTAA 


ATAAAATAGT 


AAGAATGTAT 


1620 


AGCCCTACCA 


GCATTCTGGA 


CATAAGACAA 


GGACCAAAGG 


AACCCTTTAG 


AGACTATGTA 


1680 


GACCGATTCT 


ATAAAACTCT 


AAGAGCCGAG 


CAAGCTTCAC 


AAGAGGTAAA 


AAATTGGATG 


1740 


ACAGAAACCT 


TGTTGGTCCA 


AAATGCGAAC 


CCAGATTGTA 


AGACTATTTT 


AAAAGCATTG 


1800 


GGACCAGGAG 


CGACACTAGA 


AGAAATGATG 


ACAGCATGTC 


AGGGAGTGGG 


GGGACCCGGC 


1860 


CATAAAGCAA 


GAGTTTTGGC 


TGAAGCAATG 


AGCCAAGTAA 


CAAATCCAGC 


TACCATAATG 


1920 


ATACAGAAAG 


GCAATTTTAG 


GAACCAAAGA 


AAGACTGTTA 


AGTGTTTCAA 


TTGTGGCAAA 


1980 


GAAGGGCACA 


TAGCCAAAAA 


TTGCAGGGCC 


CCTAGGAAAA 


AGGGCTGTTG 


GAAATGTGGA 


2040 


AAGGAAGGAC 


ACCAAATGAA 


AGATTGTACT 


GAGAGACAGG 


CTAA'ITITIT 


AGGGAAGATC 


2100 


TGGCCTTCCC ACAAGGGAAG 


GCCAGGGAAT 


TTTCTTCAGA 


GCAGACCAGA 


GCCAACAGCC 


2160 


CCACCAGAAG 


AGAGCTTCAG 


GTTTGGGGAA 


GAGACAACAA 


CTCCCTCTCA 


GAAGCAGGAG 


2220 


CCGATAGACA 


AGGAACTGTA 


TCCTTTAGCT 


TCCCTCAGAT 


CACTCTTTGG 


CAGCGACCCC 


2280 
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TCGTCACAAT 


AAAGATAGGG 


GGGGAATTAA 


AGGAAGCTCT 


ATTAGATACA 


GGAGCAGATG 


2340 


ATACAGTATT 


AGAAGAAATG 


AATTTGCCAG 


GAAGATGGAA 


ACCAAAAATG 


ATAGGGGGAA 


2400 


TTGGAGGTTT 


TATCAAAGTA 


GGACAGTATG 


ATCAGATACT 


CAT AGAAAT C 


TGCGGACATA 


2460 


AAGCTATAGG 


TACAGTATTA 


GTAGGACCTA 


CACCTGTCAA 


CATAATTGGA 


AGAAATCTGT 


2520 


TGACTCAGAT 


TGGCTGCACT 


TTAAATTTTC 


CCATTAGTCC 


TATTGAGACT 


GTACCAGTAA 


2580 


AATTAAAGCC 


AGGAATGGAT 


GGCCCAAAAG 


TTAAACAATG 


GCCATTGACA 


GAAGAAAAAA 


2640 


TAAAAGCATT 


AGTAGAAATT 


TGTACAGAAA 


TGGAAAAGGA 


AGGAAAAATT 


TCAAAAATTG 


2700 


GGCCTGAAAA 


TCCATACAAT 


ACTCCAGTAT 


TTGCCATAAA 


GAAAAAAGAC 


AGTACTAAAT 


2760 


GGAGAAAATT 


AGTAGATTTC 


AGAGAACTTA 


ATAAGAGAAC 


TCAAGATTTC 


TGGGAAGTTC 


2820 


AATTAGGAAT 


ACCACATCCT 


GCAGGGTTAA 


AACAGAAAAA 


ATCAGTAACA 


GTACTGGATG 


2880 


TGGGCGATGC 


ATATTTTTCA 


GTTCCCTTAG 


ATAAAGACTT 


CAGGAAGTAT 


ACTGCATTTA 


2940 


CCATACCTAG 


TATAAACAAT 


GAGACACCAG 


GGATTAGATA 


TCAGTACAAT 


GTGCTTCCAC 


3000 


AGGGATGGAA 


AGGATCACCA 


GCAATATTCC 


AGTGTAGCAT 


GACAAAAATC 


TTAGAGCCTT 


3060 


TTAGAAAACA 


AAATCCAGAC 


ATAGTCATCT 


ATCAATACAT 


GGATGATTTG 


TATGTAGGAT 


3120 


CTGACTTAGA 


AATAGGGCAG 


CATAGAACAA 


AAATAGAGGA ACTGAGACAA 


CATCTGTTGA 


3180 


GG TGGGGATT 


TACCACACCA 


GACAAAAAAC 


ATCAGAAAGA 


ACCTCCATTC 


CTTTGGATGG 


3240 


GTTATGAACT 


CCATCCTGAT 


AAATGGACAG 


TACAGCCTAT 


AGTGCTGCCA 


GAAAAGGACA 


3300 


GCTGGACTGT 


CAATGACATA 


CAGAAATTAG 


TGGGAAAATT 


GAATTGGGCA 


AGTCAGATTT 


3360 


ATGCAGGGAT 


TAAAGTAAGG 


CAATTATGTA 


AACTTCTTAG 


GGGAACCAAA 


GCACTAACAG 


3420 


AAGTAGTACC 


ACTAACAGAA 


GAAGCAGAGC 


TAGAACTGGC 


AGAAAACAGG 


GAGATTCTAA 


3480 


AAGAACCGGT 


ACATGGAGTG 


TATTATGACC 


CATCAAAAGA 


CTTAATAGCA 


GAAATACAGA 


3540 


AGCAGGGGCA 


AGGCCAATGG 


ACATATCAAA 


TTTATCAAGA 


GCCATTTAAA 


AATCTGAAAA 


3600 


CAGGAAAATA 


TGCAAGAATG 


AAGGGTGCCC 


ACACTAATGA 


TGTGAAACAA 


TTAACAGAGG 


3660 


CAGTACAAAA 


AATAGCCAGA 


GAAAGCATAG 


TAATATGGGG 


AAAGACTCCT 


AAATTTAAAT 


3720' 


TACCCATACA 


AAAGGAAACA 


TGGGAAGCAT 


GGTGGACAGA 


GTATTGGCAA 


GCCACCTGGA 


3780 


TTCCTGAGTG 


GGAGTTTGTC 


AATACCCCTC 


CCTTAGTGAA 


GTTATGGTAC 


CAGTTAGAGA 


3840 


AAGAACCCAT 


AATAGGAGCA 


GAAACTTTCT 


ATGTAGATGG 


GGCAGCCAAT 


AGGGAAACTA 


3900 


AATTAGGAAA 


AGCAGGATAT 


GTAACTGACA 


GAGGAAGACA 


AAAAGTTGTC 


CCCCTAACGG 


3960 


ACACAACAAA 


TCAGAAGACT 


GAGTTACAAG 


CAATTCATCT 


AGCTTTGCAG 


GATTCGGGAT 


4020 


TAGAAGTAAA 


CATAGTGACA 


GACTCACAAT 


ATGCATTGGG 


AATCATTCAA 


GCACAACCAG 


4080 


ATAAGAGTGA 


AT CAGAGTT A 


GTCAGTCAAA 


TAATAGAGCA 


GTTAATAAAA 


AAGGAAAAAG 


4140 


TCTACCTGGC 


ATGGGTACCA 


GCACACAAAG 


GAATTGGAGG 


AAATGAACAA 


GTAGATGGGT 


4200 


TGGTCAGTGC 


TGGAATCAGG 


AAAGTACTAT 


TTTTAGATGG 


AATAGATAAG 


GCCCAAGAAG 


4260 


AACATGAGAA 


ATATCACAGT 


AATTGGAGAG 


CAATGGCTAG 


TGATTTTAAC 


CTACCACCTG 


4320 


TAGTAGCAAA 


AGAAATAGTA 


GCCAGCTGTG 


ATAAATGTCA 


GCTAAAAGGG 


GAAGCCATGC 


4380 


ATGGACAAGT 


AGACTGTAGC 


CCAGGAATAT 


GGCAGCTAGA 


TTGTACACAT 


TTAGAAGGAA 


4440 


AAGTTATCTT 


GGTAGCAGTT 


CATGTAGCCA 


GTGGATATAT 


AGAAGCAGAA 


GTAATTCCAG 


4500 


CAGAGACAGG 


GCAAGAAACA 


GCATACTTCC 


TCTTAAAATT 


AGCAGGAAGA 


TGGCCAGTAA 


4560 


AAACAGTACA 


TACAGACAAT 


GGCAGCAATT 


TCACCAGTAC 


TACAGTTAAG 


GCCGCCTGTT 


4620 


GGTGGGCGGG 


GATCAAGCAG 


GAATTTGGCA 


TTCCCTACAA 


TCCCCAAAGT 


CAAGGAGTAA 


4680 


TAGAATCTAT 


GAATAAAGAA 


TTAAAGAAAA 


TTATAGGACA 


GGTAAGAGAT 


CAGGCTGAAC 


4740 


ATCTTAAGAC 


AGCAGTACAA 


ATGGCAGTAT 


TCATCCACAA 


TTTTAAAAGA 


AAAGGGGGGA 


4800 


TTGGGGGGTA 


CAGTGCAGGG 


GAAAGAATAG 


TAGACATAAT 


AGCAACAGAC 


ATACAAACTA 


4860 


AAGAATTACA 


AAAACAAATT 


ACAAAAATTC 


AAAATTTTCG 


GGTTTATTAC 


AGGGACAGCA 


4920 


GAGATCCAGT 


TTGGAAAGGA 


CCAGCAAAGC 


TCCTCTGGAA AGGTGAAGGG 


GCAGTAGTAA 


4980 


TACAAGATAA 


TAGTGACATA 


AAAGTAGTGC 


CAAGAAGAAA 


AGCAAAGATC 


ATCAGGGATT 


S040 


ATGGAAAACA 


GATGGCAGGT 


GATGATTGTG 


TGGCAAGTAG 


ACAGGATGAG 


GATTAACACA 


5100 
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TGGAAAAGAT 


TAGTAAAACA 


CCATATGTAT 


ATTTCAAGGA 


AAGCTAAGGA 


CTGGTTTTAT 


5160 


AGACATCACT 


ATGAAAG TAC 


TAATCCAAAA 


ATAAGTTCAG 


AAGTACACAT 


CCCACTAGGG 


5220 


GATGCTAAAT 


TAGTAATAAC 


AACATATTGG 


GGTCTGCATA 


CAGGAGAAAG 


AGACTGGCAT 


5280 


TTGGGTCAGG 


GAGTCTCCAT 


AGAATGGAGG 


AAAAAGAGAT 


ATAGCACACA 


AGTAGACCCT 


5340 


GACCTAGCAG 


AC CAACT AAT 


TCATCTGCAC 


TATTTTGATT 


GTTTTTCAGA 


ATCTGCTATA 


5400 


AGAAATACCA 


TATTAGGACG 


TATAGTTAGT 


CCTAGGTGTG 


AATATCAAGC 


AGGACATAAC 


5460 


AAGGTAGGAT 


CTCTACAGTA 


CTTGGCACTA 


GCAGCATTAA 


TAAAACCAAA 


ACAGATAAAG 


5520 


CCACCTTTGC 


CTAGTGTTAG 


GAAACTGACA 


GAGGACAGAT 


GOAACAAGCC 


CCAGAAGACC 


5580 


AAGGGCCACA 


GAGGGAG CCA 


TACAATGAAT 


GGACACTAGA 


G CTTTTAG AG 


GAACTTAAGA 


5640 


GTGAAGCTGT 


TAGACATTTT 


CCTAGGATAT 


GGCTCCATAA 


CTTAGGACAA 


CATATCTATG 


5700 


AAACTTACGG 


GGATACTTGG 


GCAGGAGTGG 


AAGCCATAAT 


AAGAATTCTG 


CAACAACTGC 


5760 


TGTTTATCCA 


TTTCAGAATT 


GGGTGTCGAC 


ATAGCAGAAT 


AGGCGTTACT 


CGACAGAGGA 


5820 


GAG C AAGAAA 


TGGAGCCAGT 


AGATCCTAGA 


CTAGAGCCCT 


GGAAG CAT CC 


AGGAAGTCAG 


5880 


CCTAAAACTG 


CTTGTACCAA 


TTGCTATTGT 


AAAAAGTGTT 


GCTTTCATTG 


CCAAGTTTGT 


5940 


TTCATGACAA 


AAGCCTTAGG 


CATCTCCTAT 


GGCAGGAAGA 


AGCGGAGACA 


GCGACGAAGA 


6000 


GCTCATCAGA 


ACAGTCAGAC 


TCATCAAGCT 


TCTCTATCAA 


AGCAGTAAGT 


AGTACATGTA 


6060 


ATGCAACCTA 


T AAT ACT AG C 


AAT ACT AG CA 


TTAGTAGTAG 


CAATAATAAT 


AGCAATAGTT 


6120 


GTGTGGTCCA 


TAGTAATCAT 


AGAATATAGG 


AAAATATTAA 


GACAAAGAAA 


AATAGACAGG 


6180 


TTAATTGATA 


GACTAATAGA 


AAGAGCAGAA 


GACAGTGGCA 


ATGAGAGTGA 


AGGAGAAGTA 


6240 


TCAGCACTTG 


TGGAGATGGG 


GGTGGAAATG 


GGGCACCATG 


CTCCTTGGGA 


TATTGATGAT 


6300 


CTGTAGTGCT 


ACAGAAAAAT 


TGTGGGTCAC 


AGTCTATTAT 


GGGGTACCTG 


TGTGGAAGGA 


6360 


AGCAACCACC 


ACTCTATTTT 


GTGCATCAGA 


TGCTAAAGCA 


TATGATACAG 


AGGTACATAA 


6420 


TGTTTGGGCC 


ACACATGCCT 


GTGTACCCAC 


AGACCCCAAC 


CCACAAGAAG 


TAGTATTGGT 


6480 


AAATGTGACA 


GAAAATTTTA 


ACATGTGGAA 


AAATGACATG 


GTAGAACAGA 


TGCATGAGGA 


6540 


TATAATCAGT 


TTATGGGATC 


AAAGCCTAAA 


GCCATGTGTA 


AAATTAACCC 


CACTCTGTGT 


6600 


TAGTTTAAAG 


TGCACTGATT 


TGAAGAATGA 


TACTAATACC 


AATAGTAGTA 


GCGGGAGAAT 


6660 


GATAATGGAG 


AAAGGAGAGA 


TAAAAAACTG 


CTCTTTCAAT 


ATCAGCACAA 


GCATAAGAGA 


6720 


TAAGGTGCAG 


AAAGAATATG 


CATTCTTTTA 


TAAACTTGAT 


ATAGTACCAA 


TAGATAATAC 


6780 


CAGCTATAGG 


TTGATAAGTT 


GTAACACCT C 


AGTCATTACA 


CAGGCCTGTC 


CAAAGGTATC 


6640 


CTTTGAGCCA 


ATTCCCATAC 


ATTATTGTGC 


CCCGGCTGGT 


TTTGCGATTC 


TAAAATGTAA 


6900 


TAATAAGACG 


TTCAATGGAA 


CAGGACCATG 


TACAAATGTC 


AGCACAGTAC 


AATGTACACA 


6960 


TGGAATCAGG 


CCAGTAGTAT 


CAACT CAACT 


GCTGTTAAAT 


GGCAGTCTAG 


CAGAAGAAGA 


7020 


TGTAGTAATT 


AGATCTGCCA 


ATTTCACAGA 


CAATGCTAAA 


ACCATAATAG 


TACAGCTGAA 


7080 


CACATCTGTA 


GAAATTAATT 


GTACAAGACC 


CAACAACAAT 


ACAAGAAAAA 


GTATCCGTAT 


7140 


CCAGAGGGGA 


CCAGGGAGAG 


CATTTGTTAC 


AATAGGAAAA 


ATAGGAAATA 


TGAGACAAGC 


7200 


ACATTGTAAC 


ATTAGTAGAG 


CAAAATGGAA 


TGCCACTTTA 


AAACAGATAG 


CTAGCAAATT 


7260 


AAGAGAACAA 


TTTGGAAATA 


ATAAAACAAT 


AATCTTTAAG 


CAATCCTCAG 


GAGGGGACCC 


7320 


AGAAATTGTA 


ACGCACAGTT 


TTAATTGTGG 


AGGGGAATTT 


TTCTACTGTA 


ATTCAACACA 


7380 


ACTGTTTAAT 


AGTACTTGGT 


TTAATAGTAC 


TTGGAGTACT 


GAAGGGTCAA 


ATAACACTGA 


7440 


AGGAAGTGAC 


ACAATCACAC 


TCCCATGCAG 


AATAAAACAA 


TTTATAAACA 


TGTGGCAGGA 


7500 


AGTAGGAAAA 


GCAATGTATG 


CCCCTCCCAT 


CAGTGGACAA 


ATTAGATGTT 


CATCAAATAT 


7560 


TACTGGGCTG 


CTATTAACAA 


GAGATGGTGG 


TAATAACAAC 


AATGGGTCCG 


AGATCTTCAG 


7620 


ACCTGGAGGA 


GG CGATATGA 


GGGACAATTG 


GAGAAGTGAA 


TTATATAAAT 


ATAAAGTAGT 


7680 


AAAAATTGAA 


CCATTAGGAG 


TAGCACCCAC 


CAAGGCAAAG 


AGAAGAGTGG 


TGCAGAGAGA 


7740 


AAAAAGAGCA 


GTGGGAATAG 


GAGcrrixjrr 


CCTTGGGTTC 


TTGGGAGCAG 


CAGGAAGCAC 


7800 


TATGGGCTGC 


ACGTCAATGA 


CGCTGACGGT 


ACAGGCCAGA 


CAATTATTGT 


CTGATATAGT 


7860 


GCAGCAGCAG 


AACAATTTGC 


TGAGGGCTAT 


TGAGGCGCAA 


CAGCATCTGT 


TGCAACTCAC 


7920 
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AGTCTGGGGC 
TCAACAGCTC 
TTGGAATGCT 
GGAGTGGGAC 
GCAAAACCAG 
GTGGAATTGG 
AGGAGGCTTG 
GCAGGGATAT 
GCCCGAAGGA 
GAACGGATCC 
CCGCTTGAGA 
GTGGGAAGCC 
TAGTGCTGTT 
TATAGAAGTA 
GGGCTTGGAA 
GATGGCCTGC 
CAGTATCTCG 
ATGCTGCTTG 
CTCAGGTACC 
AAGAAAAGGG 
TGTGGATCTA 
GGGT CAGATA 
AGGTAGAAGA 
GAATGGATGA 
ATCACGTGGC 
ACAAGGGACT 
GGCGAGCCCT 
TTAGACCAGA 
CAATAAAGCT 
AACTAGAGAT 



ATCAAACAGC 
CTGGGGATTT 
AGTTGGAGTA 
AGAGAAATTA 
CAAGAAAAGA 
TTTAACATAA 
GTAGGTTTAA 
TCACCATTAT 
ATAGAAGAAG 
TTAGCACTTA 
GACTTACTCT 
CTCAAATATT 
AACTTGCTCA 
TTACAAGCAG 
AGGATTTTGC 
TGTAAGGGAA 
AGACCTAGAA 
TGCCTGGCTA 
TTTAAGACCA 
GGGACTGGAA 
CCACACACAA 
TCCACTGACC 
GGCCAATAAA 
CCCTGAGAGA 
CCGAGAGCTG 
TTCCGCTGGG 
CAGATGCTGC 
TCTGAGCCTG 
TGCCTTGAGT 
CCCTCAGACC 



TCCAGGCAAG 
GGGGTTGCTC 
ATAAATCTCT 
ACAATTACAC 
ATGAACAAGA 
CAAATTGGCT 
GAATAGTTTT 
CGTTTCAGAC 
AAGGTGGAGA 
TCTGGGACGA 
TGATTGTAAC 
GGTGGAATCT 
ATGCCACAGC 
CTTATAGAGC 
TATAAGATGG 
AGAATGAGAC 
AAACATGGAG 
GAAGCACAAG 
ATGACTTACA 
GGGCTAATTC 
GGCTACTTCC 
TTTGGATGGT 
GGAGAGAACA 
GAAGTGTTAG 
CATCCGGAGT 
GACTTTCCAG 
ATATAAGCAG 
GGAGCTCTCT 
GCTTCAAGTA 
CTTTTAGTCA 



AATCCTGGCT 
TGGAAAACTC 
GGAACAGATT 
AAGCTTAATA 
ATTATTGGAA 
GTGGTATATA 
TGCTGTACTT 
CCACCTCCCA 
GAGAGACAGA 
TCTGCGGAGC 
GAGGATTGTG 
C CTACAGTAT 
CATAGCAGTA 
TATTCGCCAC 
GTGGCAAGTG 
GAGCTGAGCC 
CAATCACAAG 
AGGAGGAAGA 
AGGCAGCTGT 
ACTCCCAAAG 
CTGATTGGCA 
GCTACAAGCT 
CCAGCTTGTT 
AGTGGAGGTT 
ACTTCAAGAA 
GGAGGCGTGG 
CTGCTTTTTG 
GGCTAACTAG 
GTGTGTGCCC 
GTGTGGAAAA 



GTGGAAAGAT 
ATTTGCACCA 
TGGAATAACA 
CACTCCTTAA 
TTAGATAAAT 
AAATTATTCA 
TCTATAGTGA 
ATCCCGAGGG 
GACAGATCCA 
CTGTGCCTCT 
GAACTTCTGG 
TGGAGTCAGG 
GCTGAGGGGA 
ATACCTAGAA 
GTCAAAAAGT 
AGCAGCAGAT 
TAGCAATACA 
GGTGGGTTTT 
AGATCTTAGC 
AAGACAAGAT 
GAACTACACA 
AGTACCAGTT 
ACACCCTGTG 
TGACAGCCGC 
CTGCTGACAT 
CCTGGGCGGG 
CCTGTACTGG 
GGAACCCACT 
GTCTGTTGTG 
TCTCTAGCA 



ACCTAAAGGA 
CTGCTGTGCC 
TGACCTGGAT 
TTGAAGAATC 
GGGCAAGTTT 
TAATGATAGT 
ATAGAGTTAG 
GAC CCGACAG 
TTCGATTAGT 
TCAGCTACCA 
GACGCAGGGG 
AACTAAAGAA 
CAGATAGGGT 
GAATAAGACA 
AGTGTGATTG 
GGGGTGGGAG 
GCAGCTAACA 
CCAGTCACAC 
CACTTTTTAA 
ATCCTTGATC 
CCAGGGCCAG 
GAG C CAGATA 
AGCCTGCATG 
CTAGCATTTC 
CGAGCTTGCT 
ACTGGGGAGT 
GTCTCTCTGG 
GCTTAAGCCT 
TGACTCTGGT 



7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8860 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9460 
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9600 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
ATGGGTGGCA 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
TGGGTGGCAA 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GGGTGGCAAG 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GGTGGCAAGT 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
GTGGCAAGTG 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ■ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
TGGCAAGTGG 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GGCAAGTGGT 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
GCAAGTGGTC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
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CAAGTGGTCA 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
AAGTGGTCAA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGTGGTCAAA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGGTCAAAA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS. single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TGGTCAAAAA 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
GGTCAAAAAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTCAAAAAGT 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
TC AAAAAGT A 

(2) INFORMATION FOR SEQ ID NO:I8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CAAAAAGTAG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AAAAAGTAGT 
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(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO;20: 
AAAAGTAGTG 

(2) INFORMATION FOR SEQ ID NO:2I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
AAAGTAGTGT 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

AAGTAGTGTG 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

AGTAGTGTGA 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GTAGTGTGAT 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TAGTGTGATT 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
AGTGTGATTG 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GTGTGATTGG 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
TGTGATTGGA 

(2) INFORMATION FOR SEQ ID NO:29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GTGATTGGAT 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TGATTGGATG 

(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GATTGGATGG 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
ATTGGATGGC 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
TTGGATGGCC 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
TGGATGGCCT 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
GGATGGCCTG 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
GATGGCCTGC 

(2) INFORMATION FOR SEQ ED NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
ATGGCCTGCT 

(2) INFORMATION FOR SEQ ID NO:38: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TGGCCTGCTG 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
GGCCTGCTGT 

(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
GCCTGCTGTA 

(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
CCTGCTGTAA 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
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CTGCTGTAAG 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
TGCTGTAAGC 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GCTGTAAGGG 

(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
CTGTAAGGGA 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGTAAGGGAA 

(2) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
GTAAGGGAAA 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TAAGGGAAAG 
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(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
GGAAAGAATG 



(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(2) INFORMATION FOR SEQ ED NO:49: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs GAAAGAATGA 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear (2) INFORMATION FOR SEQ ID NO:54: 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
AAGGGAAAGA 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
AGGGAAAGAA 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
AAAGAATGAG 



(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 
GGGAAAGAAT 



(ii) MOLECULE TVPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 
AAGAATGAGA 



(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
AGAATGAGAC 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

GAATGAGACG 

(2) INFORMATION FOR SEQ ID NO:58: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
AATGAGACGA 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
ATGAGACGAG 

(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
TGAGACGAGC 

(2) INFORMATION FOR SEQ ID NO:61 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6l: 
GAGACGAGCT 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
AGACGAGCTG 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
GACGAGCTGA 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 
ACGAGCTGAG 

(2) INFORMATION FOR SEQ ID NO:65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
CGAGCTGAGC 

(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
GAGCTGAGCC 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
AGCTGAGCCA 

(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
GCTGAGCCAG 

(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

CTGAGCCAGC 

(2) INFORMATION FOR SEQ ID NO 70: 

(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
TGAGCCAGCA 

(2) INFORMATION FOR SEQ ID NO:71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:71: 
GAGCCAGCAG 

(2) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 
AGCCAGCAGC 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
GCCAGCAGCA 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
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CCAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
CAGCAGCAGA 

(2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
AGCAGCAGAT 

(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
GCAGCAGATG 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

CAGCAGATGG 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
AGCAGATGGG 4 

(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
GCAGATGGGG 

(2) INFORMATION FOR SEQ ID NO:81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 
CAGATGGGGT 

(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
AGATGGGGTG 

(2) INFORMATION FOR SEQ ID N0.83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 
GATGGGGTGG 
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(2) INFORMATION FOR SEQ ID NO:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
ATGGGGTGGG 

(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
TGGGGTGGGA 

(2) INFORMATION FOR SEQ ID NO:86: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
GGGGTGGGAG 

(2) INFORMATION FOR SEQ ID NO:87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 
GGGTGGGAGC 

(2) INFORMATION FOR SEQ ID NO:88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
GGTGGGAGCA 

(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
GTGGGAGCAG 

(2) INFORMATION FOR SEQ ID NO:90: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
TGGGAGCAGT 

(2) INFORMATION FOR SEQ ID NO:91 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 
GGGAGCAGTA 

(2) INFORMATION FOR SEQ ID NO:92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 
GGAGCAGTAT 

(2) INFORMATION FOR SEQ ID NO:93: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
GAGCAGTATC 

(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

AGCAGTATCT 

(2) INFORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
GCAGTATCTC 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
CAGTATCTCG 

(2) INFORMATION FOR SEQ ID NO:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
AGTATCTCGA 

(2) INFORMATION FOR SEQ ID NO:98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 
GTATCTCGAG 

(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 
TATCTCGAGA 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 
ATCTCGAGAC 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TCTCGAGACC 

(2) INFORMATION FOR SEQ ED NO:102: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I02: 
CTCGAGACCT 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TCGAGACCTA 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 
CGAGACCTAG 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: 
GAGACCTAGA 



(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



91 - 

AGACCTAGAA 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
GACCTAGAAA 



(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid _ 

(C) STRANDEDNESS: single #^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
ACCTAGAAAA 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CCTAGAAAAA 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
CTAGAAAAAC 



(2) INFORMATION FOR SEQ ID NO.l 1 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:Il I: 
TAGAAAAACA 

(2) INFORMATION FOR SEQ ID NO:l 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 12: 
AGAAAAACAT 

(2) INFORMATION FOR SEQ ID NO:l 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 
GAAAAACATG 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 4: 
AAAAACATGG 

(2) INFORMATION FOR SEQ ID NO:l 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
AAAACATGGA 



(2) INFORMATION FOR SEQ ID NO:116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: II 6: 
AAACATGGAG 

(2) INFORMATION FOR SEQ ID NO: 11 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 17: 
AACATGGAGC 

(2) INFORMATION FOR SEQ ID NO:lI8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: II 8: 
ACATGGAGCA 

(2) INFORMATION FOR SEQ ID NO:l 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:119: 
CATGGAGCAA 

(2) INFORMATION FOR SEQ ID NO: 120; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
ATGGAGCAAT 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 
TGGAGCAATC 

(2) INFORMATION FOR SEQ ID NO:122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GGAGCAATCA 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
GAGCAATCAC 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 
AGCAATCACA 

(2) INFORMATION FOR SEQ ID NO:l25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
GCAATCACAA 

(2) INFORMATION FOR SEQ ID NO: 126: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126. 
CAATCACAAG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
AATCACAAGT 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

ATCACAAGTA 

(2) INFORMATION FOR SEQ ID NO:129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
TCACAAGTAG 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
CACAAGTAGC 

(2) INFORMATION FOR SEQ ID NO:I31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:131: 
ACAAGTAGCA 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
CAAGTAGCAA 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 
AAGTAGCAAT 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
AGTAGCAATA 

(2) INFORMATION FOR SEQ ID NO:135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 
GTAGCAATAC 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
TAGCAATACA 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: 
AGCAATACAG 

(2) INFORMATION FOR SEQ ID NO: 1 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I38: 
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GCAATACAGC 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
CAATACAGCA 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:140: 
AATACAGCAG 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
ATACAGCAGC 

(2) INFORMATION FOR SEQ ID NO:142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:142: 

TACAGCAGCT 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 
ACAGCAGCTA 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
CAGCAGCTAA 

(2) INFORMATION FOR SEQ ID NO.I45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
AGCAGCTAAC 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
GCAGCTAACA 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
CAGCTAACAA 
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(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

AGCTAACAAT 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I52: 
AACAATGCTG 



(2) INFORMATION FOR SEQ ID NO; 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(2) INFORMATION FOR SEQ ID NO.T49: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs ACAATGCTGC 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear (2) INFORMATION FOR SEQ ID NO; 154: 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO:149: 
GCTAACAATG 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO: 150; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
CTAACAATGC 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:154: 
CAATGCTGCT 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I51: 
TAACAATGCT 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
AATGCTGCTT 

(2) INFORMATION FOR SEQ ID NO: 1 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
ATGCTGCTTG 

(2) INFORMATION FOR SEQ ID NO: 157: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
TGCTGCTTGT 

(2) INFORMATION FOR SEQ ID NO: 158: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
"(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

GCTGCTTGTG 

(2) INFORMATION FOR SEQ ID NO: 1 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
CTGCTTGTGC 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
TGCTTGTGCC 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:161: 
GCTTGTGCCT 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
CTTGTGCCTG 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TTGTGCCTGG 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: 
TGTGCCTGGC 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 
GTGCCTGGCT 

(2) INFORMATION FOR SEQ ID NO:166: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
TGCCTGGCTA 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
GCCTGGCTAG 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
CCTGGCTAGA 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 69: 
CTGGCTAGAA 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY, linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
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TGGCTAGAAG 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) * TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
GGCTAGAAGC 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
GCTAGAAGCA 

(2) INFORMATION FOR SEQ ID NO: 1 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: 
CTAGAAGCAC 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
TAGAAGCACA 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 
AGAAGCACAA 



(2) INFORMATION FOR SEQ ID NO: 1 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 
GAAGCACAAG 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
AAGCACAAGA 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 178: 
AGCACAAGAG 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
GCACAAGAGG 
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(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
CACAAGAGGA 



(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
ACAAGAGGAG 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: ^gr 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 
CAAGAGGAGG 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
AAGAGGAGGA 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 84: 
AGAGGAGGAA 

(2) INFORMATION FOR SEQ ID NO: 1 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 
GAGGAGGAAG 

(2) INFORMATION FOR SEQ ID NO: 1 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
AGGAGGAAGA 

(2) INFORMATION FOR SEQ ID NO:187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
GGAGGAAGAG 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 
GAGGAAGAGG 

(2) INFORMATION FOR SEQ ID NO: 189: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 
AGGAAGAGGT 

(2) INFORMATION FOR SEQ ID NO: 1 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 
GGAAGAGGTG 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 
GAAGAGGTGG 

(2) INFORMATION FOR SEQ ED NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
AAGAGGTGGG 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ IDNO:193: 
AGAGGTGGGT 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: 
GAGGTGGGTT 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:l95: 
AGGTGGGTTT 



(2) INFORMATION FOR SEQ ID NO:196: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I96: 
GGTGGGT1TT 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:197 
GTGGGTTTTC 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
TGGGTTTTCC 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
GGGTTTTCCA 



(2) INFORMATION FOR SEQ ID NO:200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ , 

(D) TOPOLOGY: linear J 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: ; 
GGTTTTCCAG r ." 



(2) INFORMATION FOR SEQ ID NO:201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:201: 
GTTTTCCAGT 



(2) INFORMATION FOR SEQ ID NO:202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS : 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ CD NO:202: 
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ttttccagtc 

(2) information for seq id no:203: 

(i) sequence characteristics: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 
TTTCCAGTCA 

(2) INFORMATION FOR SEQ ID NO.204: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 
TTCCAGTCAC 

(2) INFORMATION FOR SEQ ID NO:205: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:205: 
TCCAGTCACA 

(2) INFORMATION FOR SEQ ID NO:206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

CCAGTCACAC 

(2) INFORMATION FOR SEQ ID NO:207: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207; 
CAGTCACACC 

(2) INFORMATION FOR SEQ ID NO:208: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 
AGTCACACCT 

(2) INFORMATION FOR SEQ ID NO:209: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 
GTCACACCTC 

(2) INFORMATION FOR SEQ ID NO:210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2IO: 
TCACACCTCA 

(2) INFORMATION FOR SEQ ID NO:2 1 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2ll: 
CACACCTCAG 
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(2) INFORMATION FOR SEQ ID NO:212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 
ACACCTCAGG 

(2) INFORMATION FOR SEQ ID NO:213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 
CACCTCAGGT 

(2) INFORMATION FOR SEQ ID NO:214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: 
ACCTCAGGTA 

(2) INFORMATION FOR SEQ ID NO:215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 
CCTCAGGTAC 

(2) INFORMATION FOR SEQ ID NO.216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 
CTCAGGTACC 

(2) INFORMATION FOR SEQ ID NO:2I7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 
TCAGGTACCT 

(2) INFORMATION FOR SEQ ID NO:218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 
CAGGTACCTT 

(2) INFORMATION FOR SEQ ID NO:219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: 
AGGTACCTTT 

(2) INFORMATION FOR SEQ ID NO220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 
GGTACCTTTA 

(2) INFORMATION FOR SEQ ID NO:221: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:221: 
GTACCTTTAA 

(2) INFORMATION FOR SEQ ID NO:222: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 
TACCTTTAAG 

(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 
ACCTTTAAGA 

(2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: 
CCTTTAAGAC 

(2) INFORMATION FOR SEQ ID NO:225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



104 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 
CTTTAAGACC 

(2) INFORMATION FOR SEQ ID NO:226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:226: 
TTTAAGACCA 

(2) INFORMATION FOR SEQ ID NO:227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:227: 
TTAAGACCAA 

(2) INFORMATION FOR SEQ ID NO:228: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
TAAGACCAAT 

(2) INFORMATION FOR SEQ ID NO:229: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:229: 
AAGACCAATG 

(2) INFORMATION FOR SEQ ID NO:230: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 
AGACCAATGA 

(2) INFORMATION FOR SEQ ID NO:231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:231: 
GACCAATGAC 

(2) INFORMATION FOR SEQ ID NO:232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION. SEQ ID NO:232: 
ACCAATGACT 

(2) INFORMATION FOR SEQ ID NO:233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 
CCAATGACTT 

(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 
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CAATGACTTA 

(2) INFORMATION FOR SEQ ID NO:235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:235: 
AATGACTTAC 

(2) INFORMATION FOR SEQ ID NO:236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:236: 
ATGACTTACA 

(2) INFORMATION FOR SEQ ID NO:237: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 
TGACTTACAA 

(2) INFORMATION FOR SEQ ID NO:238: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: 
GACTTACAAG 

(2) INFORMATION FOR SEQ ID NO:239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 
ACTTACAAGG 

(2) INFORMATION FOR SEQ ID NO:240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 
CTTACAAGGC 

(2) INFORMATION FOR SEQ ID NO:24I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:241: 
TTACAAGGCA 

(2) INFORMATION FOR SEQ ID NO:242: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 
TACAAGGCAG 

(2) INFORMATION FOR SEQ ID NO:243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS. single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

ACAAGGCAGC 



(2) INFORMATION FOR SEQ ID NO:244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:244: 
CAAGGCAGCT 

(2) INFORMATION FOR SEQ ID NO:245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:245: 
AAGGCAGCTG 

(2) INFORMATION FOR SEQ ID NO:246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 
AGGCAGCTGT 

(2) INFORMATION FOR SEQ ID NO:247: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.247: 
GGCAGCTGTA 

(2) INFORMATION FOR SEQ ID NO:248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO;248: 
GCAGCTGTAG 

(2) INFORMATION FOR SEQ ID NO:249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 
CAGCTGTAGA 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:253: 
TGTAGATCTT 



(2) INFORMATION FOR SEQ ID NO:254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250: 
AGCTGTAGAT 



(2) INFORMATION FOR SEQ ID NO:251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 
GCTGTAGATC 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:254: 
GTAGATCTTA 

(2) INFORMATION FOR SEQ ID NO:255: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:255; 
TAGATCTTAG 



(2) INFORMATION FOR SEQ ID NOJ256: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID N0252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 
CTGTAGATCT 



(2) INFORMATION FOR SEQ ID NO:253: 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 
AGATCTTAGC 



(2) INFORMATION FOR SEQ ID NO:257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 
GATCTTAGCC 

(2) INFORMATION FOR SEQ ID NO:258: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pain; 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 
ATCTTAGCCA 

(2) INFORMATION FOR SEQ ID NO:259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:259: 
TCTTAGCCAC 

(2) INFORMATION FOR SEQ ID NO:260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 
CTTAGCCACT 

(2) INFORMATION FOR SEQ ID NO:261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26I: 
TTAGCCACTT 

(2) INFORMATION FOR SEQ ID NO:262; 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 
TAGCCACTTT 

(2) INFORMATION FOR SEQ ID NO:263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 
AGCCACTTTT 

(2) INFORMATION FOR SEQ ID NO:264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 
GCCACTTTTT 

(2) INFORMATION FOR SEQ ID NO:265: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 
CCACl'l'liTA 

(2) INFORMATION FOR SEQ ID NO:266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:266: 
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CACTTTTTAA 

(2) INFORMATION FOR SEQ ID NO:267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:267: 
ACTTTTTAAA 

(2) INFORMATION FOR SEQ ID NO:268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:268: 
CTTTTTAAAA 

(2) INFORMATION FOR SEQ ID NO:269: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:269: 
TTTTTAAAAG 

(2) INFORMATION FOR SEQ ID NO:270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 
TTTTAAAAGA 

(2) INFORMATION FOR SEQ ID NO:271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 
TTTAAAAGAA 

(2) INFORMATION FOR SEQ ID NO:272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: 
TTAAAAGAAA 

(2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 
TAAAAGAAAA 

(2) INFORMATION FOR SEQ ID NO:274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 
AAAAGAAAAG 

(2) INFORMATION FOR SEQ ID NO:275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:275: 

AAAGAAAAGG 
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(2) INFORMATION FOR SEQ ID NO:276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 
AAGAAAAGGG 

(2) INFORMATION FOR SEQ ID NO:277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: 
AGAAAAGGGG 

(2) INFORMATION FOR SEQ ID NO:278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:278: 
GAAAAGGGGG 

(2) INFORMATION FOR SEQ ID NO:279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 
AAAAGGGGGG 

(2) INFORMATION FOR SEQ ID NO:280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 
AAAGGGGGGA 

(2) INFORMATION FOR SEQ ID NO:281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:281: 
AAGGGGGGAC 

(2) INFORMATION FOR SEQ ID NO:282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.282: 
AGGGGGGACT 

(2) INFORMATION FOR SEQ ID NO:283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 
GGGGGGACTG 

(2) INFORMATION FOR SEQ ID NO:284: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:284: 
GGGGGACTGG 

(2) INFORMATION FOR SEQ ID NOS85: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 
GGGGACTGGA 

(2) INFORMATION FOR SEQ ID NO:286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:286: 
GGGACTGGAA 

(2) INFORMATION FOR SEQ ID NO:287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0 287: 
GGACTGGAAG 

(2) INFORMATION FOR SEQ ID NO:288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 
GACTGGAAGG 

(2) INFORMATION FOR SEQ ID N0 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:289: 
ACTGGAAGGO 

(2) INFORMATION FOR SEQ ID NO:290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:290: 
CTGGAAGGGC 

(2) INFORMATION FOR SEQ ID NO:291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:291: 
TGGAAGGGCT 

(2) INFORMATION FOR SEQ ID NO:292: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 
GGAAGGGCTA 

(2) INFORMATION FOR SEQ ID NO:293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:293: 
GAAGGGCTAA 

(2) INFORMATION FOR SEQ ID NO:294: 
(i) SEQUENCE CHARACTERISTICS: 



WO 95/21912 



PCT/AU95/00063 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 
AAGGGCTAAT 

(2) INFORMATION FOR SEQ ID NO:295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295: 
AGGGCTAATT 

(2) INFORMATION FOR SEQ ID NO:296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:296: 
GGGCTAATTC 

(2) INFORMATION FOR SEQ ID NO:297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:297: 
GGCTAATTCA 

(2) INFORMATION FOR SEQ ID NO:298: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298; 
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GCTAATTCAC 

(2) INFORMATION FOR SEQ ID NO:299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:299: 
CTAATTCACT 

(2) INFORMATION FOR SEQ ID NO:300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:300: 
TAATTCACTC 

(2) INFORMATION FOR SEQ ID NO:301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30l: 
AATTCACTCC 

(2) INFORMATION FOR SEQ ID NO:302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:302: 
ATTCACTCCC 

(2) INFORMATION FOR SEQ ID NO:303: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 
TTCACTCCCA 

(2) INFORMATION FOR SEQ ID NO:304: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: 
TCACTCCCAA 

(2) INFORMATION FOR SEQ ID NO:305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:305: 
CACTCCCAAA 

(2) INFORMATION FOR SEQ ID NO:306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:306: 
ACTCCCAAAG 

(2) INFORMATION FOR SEQ ID NO:307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:307: 
CTCCCAAAGA 



(2) INFORMATION FOR SEQ ID NO:308. 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
TCCCAAAGAA 

(2) INFORMATION FOR SEQ ID NO:309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:309: 
CCCAAAGAAG 

(2) INFORMATION FOR SEQ ID NO:310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:310: 
CCAAAGAAGA 

(2) INFORMATION FOR SEQ ID NO:3i 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31 1: 
CAAAGAAGAC 

(2) INFORMATION FOR SEQ ID NO:312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:312: 
AAAGAAGACA 

(2) INFORMATION FOR SEQ ID NO:313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:313: 
AAGAAGACAA 

(2) INFORMATION FOR SEQ ID NO:314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:314: 
AGAAGACAAG 

(2) INFORMATION FOR SEQ ID NO:315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:315: 
GAAGACAAGA 

(2) INFORMATION FOR SEQ ID NO:316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:316: 
AAGACAAGAT 

(2) INFORMATION FOR SEQ ED NO:317: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:317: 
AGACAAGATA 

(2) INFORMATION FOR SEQ ID NO:318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE. DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3I8: 
GACAAGATAT 

(2) INFORMATION FOR SEQ ID NO:3l9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:319: 
ACAAGATATC 

(2) INFORMATION FOR SEQ ID NO:320: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:320: 
CAAGATATCC 

(2) INFORMATION FOR SEQ ID NO:321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32I: 
AAGATATCCT 



(2) INFORMATION FOR SEQ ID NO:322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 
AGATATCCTT 



(2) INFORMATION FOR SEQ ID NO:323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:323: 
GATATCCTTG 



(2) INFORMATION FOR SEQ ID NO:324: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:324: 
ATATCCTTGA 



(2) INFORMATION FOR SEQ ID NO:325: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325 
TATCCTTGAT 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:326: 
ATCCTTGATC 



(2) INFORMATION FOR SEQ ID NO:327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:327: 
TCCTTGATCT 



(2) INFORMATION FOR SEQ ID NO:328: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ~ ; 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:328: <^ 
CCTTGATCTG 



(2) INFORMATION FOR SEQ ID NO:329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0329: 
CTTGATCTGT 



(2) INFORMATION FOR SEQ ID NO:330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:326: 

(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO330: 
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TTOATCTGTG 

(2) INFORMATION FOR SEQ ID NO:33 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:331: 
TGATCTGTGG 

(2) INFORMATION FOR SEQ ID NO:332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:332: 
GATCTGTGGA 

(2) INFORMATION FOR SEQ ID NO:333; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:333: 
ATCTGTGGAT 

(2) INFORMATION FOR SEQ ID NO:334: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:334: 
TCTGTGGATC 

(2) INFORMATION FOR SEQ ID NO:335: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:335: 
CTGTGGATCT 

(2) INFORMATION FOR SEQ ID NO:336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:336: 
TGTGGATCTA 

(2) INFORMATION FOR SEQ ID NO:337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:337: 
GTGGATCTAC 

(2) INFORMATION FOR SEQ ID NO:338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:338: 
TGGATCTACC 

(2) INFORMATION FOR SEQ ID NO:339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:339: 
GGATCTACCA 
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(2) INFORMATION FOR SEQ ID NO:340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:340: 
GATCTACCAC 



(2) INFORMATION FOR SEQ ID NO:341 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:341: 
ATCTACCACA 



(2) INFORMATION FOR SEQ ID NO:342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:342: 
TCTACCACAC 



(2) INFORMATION FOR SEQ ID NO:343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: 
CTACCACACA 



(2) INFORMATION FOR SEQ ID NO:344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:344: 
TACCACACAC 



(2) INFORMATION FOR SEQ ID NO:345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345: 
ACCACACACA 



(2) INFORMATION FOR SEQ ID NO:346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:346: 

CCACACACAA 

(2) INFORMATION FOR SEQ ID NO:347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:347: 

CACACACAAG 




(2) INFORMATION FOR SEQ ID NO:348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:348: 
ACACACAAGG 



(2) INFORMATION FOR SEQ ID NO:349: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:349: 
CACACAAGGC 

(2) INFORMATION FOR SEQ ID NO:350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:350: 
ACACAAGGCT 

(2) INFORMATION FOR SEQ ID NO:351 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35I: 
CACAAGGCTA 

(2) INFORMATION FOR SEQ ID NO:352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 
ACAAGGCTAC 

(2) INFORMATION FOR SEQ ID N0 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 
CAAGGCTACT 

(2) INFORMATION FOR SEQ ID NO:354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354: 
AAGGCTACTT 

(2) INFORMATION FOR SEQ ID NO:355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:355: 
AGGCTACTTC 

(2) INFORMATION FOR SEQ ID NO:356: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:356: 
GGCTACTTCC 

(2) INFORMATION FOR SEQ ID NO:357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:357: 
GCTACTTCCC 

(2) INFORMATION FOR SEQ ID NO:358: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:358 
CTACTTCCCT 



(2) INFORMATION FOR SEQ ID NO:359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:359: 
TACTTCCCTG 



(2) INFORMATION FOR SEQ ID NO:360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:360: 
ACTTCCCTGA 



(2) INFORMATION FOR SEQ ID NO:361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:361: 
CTTCCCTGAT 



(2) INFORMATION FOR SEQ ID NO:362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :3 62: 
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TTCCCTGATT 



(2) INFORMATION FOR SEQ ID NO:363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE, nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: 
TCCCTGATTG 



(2) INFORMATION FOR SEQ ID NO:364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:364: 3& 
CCCTGATTGG 



(2) INFORMATION FOR SEQ ID NO:365: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:365: 
CCTGATTGGC 



(2) INFORMATION FOR SEQ ID NO:366: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:366: 
CTGATTGGCA 



(2) INFORMATION FOR SEQ ID NO:367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:367: 
TGATTGGCAG 

(2) INFORMATION FOR SEQ ID NO:368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:368:. 
GATTGGCAGA 

(2) INFORMATION FOR SEQ ID NO:369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:369: 
ATTGGCAGAA 

(2) INFORMATION FOR SEQ ID NO:370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:370: 
TTGGCAGAAC 

(2) INFORMATION FOR SEQ ID NO:371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:371: 
TGGCAGAACT 



(2) INFORMATION FOR SEQ ID NO:372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:372: 
GGCAGAACTA 

(2) INFORMATION FOR SEQ ID NO:373: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:373: 
GCAGAACTAC 

(2) INFORMATION FOR SEQ ID NO:374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:374: 
CAGAACTACA 

(2) INFORMATION FOR SEQ ID NO:375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;375: 
AGAACTACAC 

(2) INFORMATION FOR SEQ ID NO:376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:376: 
GAACTACACA 

(2) INFORMATION FOR SEQ ID NO:377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:377: 
AACTACACAC 

(2) INFORMATION FOR SEQ ID NO:378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:378: 
ACTACACACC 

(2) INFORMATION FOR SEQ ID NO:379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:379: 
CTACACACCA 

(2) INFORMATION FOR SEQ ID NO:380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:380: 
TACACACCAG 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:381: 
ACACACCAGG 

(2) INFORMATION FOR SEQ ID NO:382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:382: 
CACACCAGGG 

(2) INFORMATION FOR SEQ ID NO:383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:383: 
ACACCAGGGC 

(2) INFORMATION FOR SEQ ID NO:384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:384: 
CACCAGGGCC 

(2) INFORMATION FOR SEQ ID NO:385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:381: 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:385: 
ACCAGGGCCA 

(2) INFORMATION FOR SEQ ID NO:386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:386: 
CCAGGGCCAG 

(2) INFORMATION FOR SEQ ID NO:387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:387: 

CAGGGCCAGG 

(2) INFORMATION FOR SEQ ID NO:388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:388: 
AGGGCCAGGG 

(2) INFORMATION FOR SEQ ED NO:389: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 10 base pairs 
(B) TYPE: nucleic acid 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:389: 

GGGCCAGGGG 

(2) INFORMATION FOR SEQ ID NO:390: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:390: 
GGCCAGGGGT 

(2) INFORMATION FOR SEQ ID NO:391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:391: 
GCCAGGGGTC 

(2) INFORMATION FOR SEQ ID NO:392: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:392: 
CCAGGGGTCA 

(2) INFORMATION FOR SEQ H> NO:393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:393: 
CAGGGGTCAG 

(2) INFORMATION FOR SEQ ID NO:394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 
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AGGGGTCAGA 

(2) INFORMATION FOR SEQ ID NO:395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:395: 
GGGGTCAGAT 

(2) INFORMATION FOR SEQ ID NO:396: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:396: 
GGGTCAGATA 

(2) INFORMATION FOR SEQ ID NO:397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:397: 
GGTCAGATAT 

(2) INFORMATION FOR SEQ ED NO:398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:398: 

GTCAGATATC 

(2) INFORMATION FOR SEQ ID NO:399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:399: 
TCAGATATCC 

(2) INFORMATION FOR SEQ ID NO:400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:400: 
CAGATATCCA 

(2) INFORMATION FOR SEQ ID NO:401 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 401: 
AGATATCCAC 

(2) INFORMATION FOR SEQ ID NO:402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:402: 
GATATCCACT 

(2) INFORMATION FOR SEQ ID NO:403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 
ATATCCACTG 
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(2) INFORMATION FOR SEQ ID NO:404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pains 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:404: 
TATCCACTGA 

(2) INFORMATION FOR SEQ ID NO:405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 405: 
ATCCACTGAC 

(2) INFORMATION FOR SEQ ID NO:406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:406: 
TCCACTGACC 

(2) INFORMATION FOR SEQ ID NO:407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:407: 
CCACTGACCT 

(2) INFORMATION FOR SEQ ID NO 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 
CACTGACCTT 

(2) INFORMATION FOR SEQ ID NO:409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:409: 
ACTGACCTTT 

(2) INFORMATION FOR SEQ ID NO:410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:410: 
CTGACCTTTG 

(2) INFORMATION FOR SEQ ID N0 41 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41I: 
TGACCTTTGG 

(2) INFORMATION FOR SEQ ID NO:412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:412: 
GACCTTTGGA 

(2) INFORMATION FOR SEQ ID NO:4I3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 
ACCTTTGGAT 

(2) INFORMATION FOR SEQ ID NO:414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4I4: 
CCTTTGGATG 

(2) INFORMATION FOR SEQ ID NO:415: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415: 
CTTTGGATGG 

(2) INFORMATION FOR SEQ ID NO:416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4l6: 
TTTGGATGGT 

(2) INFORMATION FOR SEQ ID NO:417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4I7: 
TTGGATGGTG 

(2) INFORMATION FOR SEQ ID NO:418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:418: 
TGGATGGTGC 

(2) INFORMATION FOR SEQ ID NO:419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:419: 
GGATGGTGCT 

(2) INFORMATION FOR SEQ ID NO:420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:420: 
GATGGTGCTA 

(2) INFORMATION FOR SEQ ID NO:421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:421: 
ATGGTGCTAC 

(2) INFORMATION FOR SEQ ID NO:422: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:422: 
TGGTGCTACA 

(2) INFORMATION FOR SEQ ID NO:423: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:423: 
GGTGCTACAA 

(2) INFORMATION FOR SEQ ID NO:424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:424: 
GTGCTACAAG 

(2) INFORMATION FOR SEQ ID NO:425; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDN0 425: 
TGCTACAAGC 

(2) INFORMATION FOR SEQ ID NO:426: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:426: 
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GCTACAAGCT 

(2) INFORMATION FOR SEQ ID NO:427: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:427: 
CTACAAGCTA 

(2) INFORMATION FOR SEQ ID NO:428: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:428: 
TACAAGCTAG 

(2) INFORMATION FOR SEQ ID NO:429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:429: 
ACAAGCTAGT 

(2) INFORMATION FOR SEQ ID NO:430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: 
CAAGCTAGTA 

(2) INFORMATION FOR SEQ ID NO:431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:431: 
AAGCTAGTAC 

(2) INFORMATION FOR SEQ ID NO:432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432: 
AGCTAGTACC 

(2) INFORMATION FOR SEQ ID NO:433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: 
GCTAGTACCA 

(2) INFORMATION FOR SEQ ID NO:434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:434: 
CTAGTACCAG 

(2) INFORMATION FOR SEQ ID NO:435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:435: 
TAGTACCAGT 



(2) INFORMATION FOR SEQ ID NO:436: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:436: 
AGTACCAGTT 

(2) INFORMATION FOR SEQ ID NO:437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:437: 
GTACCAGTTG 

(2) INFORMATION FOR SEQ ID NO:438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:438: 
TACCAGTTGA 

(2) INFORMATION FOR SEQ ID NO:439: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:439: 
ACCAGTTGAG 

(2) INFORMATION FOR SEQ ID NO:440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 
CCAGTTGAGC 

(2) INFORMATION FOR SEQ ID NO:44 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:441: 
CAGTTGAGCC 

(2) INFORMATION FOR SEQ ID NO:442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:442: 
AGTTGAGCCA 

(2) INFORMATION FOR SEQ ID NO:443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:443: 
GTTGAGCCAG 

(2) INFORMATION FOR SEQ ID NO:444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:444: 
TTGAGCCAGA 

(2) INFORMATION FOR SEQ ID NO:445: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:445: 
TGAGCCAGAT 

(2) INFORMATION FOR SEQ ID NO:446: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:446: 
GAGCCAGATA 

(2) INFORMATION FOR SEQ ID NO:447: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:447: 
AGCCAGATAA 

(2) INFORMATION FOR SEQ ID NO:448: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:448: 
GCCAGATAAG 

(2) INFORMATION FOR SEQ ID NO:449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:449: 
CCAGATAAGG 

(2) INFORMATION FOR SEQ ID NO:450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:450: 
CAGATAAGGT 

(2) INFORMATION FOR SEQ ID NO:451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:451: 

AGATAAGGTA 

(2) INFORMATION FOR SEQ ID NO:452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 
GATAAGGTAG 

(2) INFORMATION FOR SEQ ID NO:453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453 
ATAAGGTAGA 

(2) INFORMATION FOR SEQ ID NO:454: 

(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454: 
TAAGGTAGAA 

(2) INFORMATION FOR SEQ ID NO:455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:455: 
AAGGTAGAAG 

(2) INFORMATION FOR SEQ ID NO:456: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:456: 
AGGTAGAAGA 

(2) INFORMATION FOR SEQ ID NO:457: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:457: 
GGTAGAAGAG 

(2) INFORMATION FOR SEQ ID NO:458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:458: 
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GTAGAAGAGG 

(2) INFORMATION FOR SEQ ID NO:459: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:459: 

TAGAAGAGGC 

(2) INFORMATION FOR SEQ ID NO:460: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:460: 
AGAAGAGGCC 

(2) INFORMATION FOR SEQ ID NO:46I: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46I: 
GAAGAGGCCA 

(2) INFORMATION FOR SEQ ID NO:462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO:462: 
AAGAGGCCAA 

(2) INFORMATION FOR SEQ ID NO:463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:463: 
AGAGGCCAAT 

(2) INFORMATION FOR SEQ ID NO:464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:464: 
GAGGCCAATA 

(2) INFORMATION FOR SEQ ED NO:465: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:465: 
AGGCCAATAA 

(2) INFORMATION FOR SEQ ID NO:466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:466: 
GGCCAATAAA 

(2) INFORMATION FOR SEQ ID NO:467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:467: 
GCCAATAAAG 
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(2) INFORMATION FOR SEQ ID NO:468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:468: 
CCAATAAAGG 



(2) INFORMATION FOR SEQ ID NO:469: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TVPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:472: 
TAAAGGAGAG 

(2) INFORMATION FOR SEQ ID NO:473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 
AAAGGAGAGA 

(2) INFORMATION FOR SEQ ID NO:474: 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:469: 
CAATAAAGGA 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(2) INFORMATION FOR SEQ ID NO 470: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:474; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs AAGGAGAGAA 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear (2) INFORMATION FOR SEQ ID NO:475: 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ED NO:470: 
AATAAAGGAG 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:471: 
ATAAAGGAGA 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:475: 
AGGAGAGAAC 



(2) INFORMATION FOR SEQ ID NO:476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(it) MOLECULE TYPE: DNA 

(2) INFORMATION FOR SEQ ID NO:472: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs GGAGAGAACA 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 

(D) TOPOLOGY: linear (2) INFORMATION FOR SEQ ID NO:477: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:477: 
GAGAGAACAC 

(2) INFORMATION FOR SEQ ID NO:478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:478: 
AGAGAACACC 

(2) INFORMATION FOR SEQ ID NO:479: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:479: 
GAGAACACCA 

(2) INFORMATION FOR SEQ ID NO:480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:480: 

AGAACACCAG 

(2) INFORMATION FOR SEQ ID NO:48I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY, linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:481: 
GAACACCAGC 

(2) INFORMATION FOR SEQ ID NO:482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:482: 
AACACCAGCT 

(2) INFORMATION FOR SEQ ID NO:483: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:483: 
ACACCAGCTT 

(2) INFORMATION FOR SEQ ID NO:484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:484: 
CACCAGCTTG 

(2) INFORMATION FOR SEQ ID NO:485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:485: 
ACCAGCTTGT 

(2) INFORMATION FOR SEQ ID NO:486: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0 486: 
CCAGCTTGTT 



(2) INFORMATION FOR SEQ ID NO:487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION. SEQ ID NO:487: 
CAGCTTGTTA 



(2) INFORMATION FOR SEQ ID NO:488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:488: 
AGCTTGTTAC 



(2) INFORMATION FOR SEQ ID NO:489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:489: 
GCTTGTTACA 



(2) INFORMATION FOR SEQ ID NO:490: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:490: 
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CTTGTTACAC 



(2) INFORMATION FOR SEQ ID NO:491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:49I: 
TTGTTACACC 



(2) INFORMATION FOR SEQ ID NO:492: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single # 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:492: ^ 
TGTTACACCC 



(2) INFORMATION FOR SEQ ID NO:493: $ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single , 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:493: : ^ 
GTTACACCCT ^ 



(2) INFORMATION FOR SEQ ID NO:494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:494: 
TTACACCCTG 



(2) INFORMATION FOR SEQ ID NO:495: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:495: 
TACACCCTGT 

(2) INFORMATION FOR SEQ ID NO:496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:496: 
ACACCCTGTG 

(2) INFORMATION FOR SEQ CD NO:497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:497: 
CACCCTGTGA 

(2) INFORMATION FOR SEQ ID NO:498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:498: 
ACCCTGTGAG 

(2) INFORMATION FOR SEQ ID NO:499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:499: 
CCCTGTGAGC 



(2) INFORMATION FOR SEQ ID NO:500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:500: 
CCTGTGAGCC 

(2) INFORMATION FOR SEQ ID NO:50I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:501: 
CTGTGAGCCT 

(2) INFORMATION FOR SEQ ID NO:502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:502: 
TGTGAGCCTG 

(2) INFORMATION FOR SEQ ID NO:503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:503: 
GTGAGCCTGC 

(2) INFORMATION FOR SEQ ID NO:504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:504: 
TGAGCCTGCA 

(2) INFORMATION FOR SEQ ID NO:505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:505: 
GAGCCTGCAT 

(2) INFORMATION FOR SEQ ID NO:506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:506: 

AGCCTGCATG 

(2) INFORMATION FOR SEQ ID NO:507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:507: 
GCCTGCATGG 

(2) INFORMATION FOR SEQ ID NO:508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:508: 
CCTGCATGGA 

(2) INFORMATION FOR SEQ ID NO:509: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:509: 
CTGCATGGAA 

(2) INFORMATION FOR SEQ ID NO:510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:510: 
TGCATGGAAT 

(2) INFORMATION FOR SEQ ID NO:51 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 1 1: 
GCATGGAATG 

(2) INFORMATION FOR SEQ ED NO:512: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:512: 
CATGGAATGG 

(2) INFORMATION FOR SEQ ID NO:513: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:513: 
ATGGAATGGA 

(2) INFORMATION FOR SEQ ID NO:514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucicic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:514; 
TGGAATGGAT 

(2) INFORMATION FOR SEQ ID NO:515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5I5: 
GGAATGGATG 

(2) INFORMATION FOR SEQ ID NO:516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:516: 
GAATGGATGA 

(2) INFORMATION FOR SEQ ID NO:5 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucicic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5I7: 

AATGGATGAC 

(2) INFORMATION FOR SEQ ID NO:518: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:518: 
ATGGATGACC 

(2) INFORMATION FOR SEQ ID NO:519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
t 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:519: 
TGGATGACCC 

(2) INFORMATION FOR SEQ ID NO:520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:520: 
GGATGACCCT 

(2) INFORMATION FOR SEQ ID NO:521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:521: 
GATGACCCTG 

(2) INFORMATION FOR SEQ ID NO:522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:522: 
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ATGACCCTGA 

(2) INFORMATION FOR SEQ ID NO:523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:523: 
TGACCCTGAG 

(2) INFORMATION FOR SEQ ID NO:524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:524: 
GACCCTGAGA 

(2) INFORMATION FOR SEQ ID NO:525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:525: 
ACCCTGAGAG 

(2) INFORMATION FOR SEQ ID NO:526: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH. 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:526: 
CCCTGAGAGA 

(2) INFORMATION FOR SEQ ID NO:527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:527: 

CCTGAGAGAG 

(2) INFORMATION FOR SEQ ID NO:528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:528: 
CTGAGAGAGA 

(2) INFORMATION FOR SEQ ID NO:529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:529: 

TGAGAGAGAA 

(2) INFORMATION FOR SEQ ID NO:530: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:530: 
GAGAGAGAAG 

(2) INFORMATION FOR SEQ ID NO:53 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:531: 
AGAGAGAAGT 
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(2) INFORMATION FOR SEQ ID NO:532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:532: 

GAGAGAAGTG 

(2) INFORMATION FOR SEQ ID NO:533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:533: 
AGAGAAGTGT 

(2) INFORMATION FOR SEQ ID NO:534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:534: 
GAGAAGTGTT 

(2) INFORMATION FOR SEQ ID NO:535: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:535: 
AGAAGTGTTA 

(2) INFORMATION FOR SEQ ID NO:536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:536: 
GAAGTGTTAG 

(2) INFORMATION FOR SEQ ID NO:537: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:537: 
AAGTGTTAGA 

(2) INFORMATION FOR SEQ ID NO:538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:538: 
AGTGTTAGAG 

(2) INFORMATION FOR SEQ ID NO:539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:539: 
GTGTTAGAGT 

(2) INFORMATION FOR SEQ ID NO:540: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:540: 
TGTTAGAGTG 

(2) INFORMATION FOR SEQ ID NO:541: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:541 
GTTAGAGTGG 



(2) INFORMATION FOR SEQ ID NO:542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:542: 
TTAGAGTGGA 



(2) INFORMATION FOR SEQ ID NO:543: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO S43: 
TAGAGTGGAG 



(2) INFORMATION FOR SEQ ID NO:544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:544 
AGAGTGGAGG 



(2) INFORMATION FOR SEQ ID NO:545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:545: 
GAGTGGAGGT 



(2) INFORMATION FOR SEQ ID NO:546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:546: 
AGTGGAGGTT 



(2) INFORMATION FOR SEQ ID NO:547: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:547; 
GTGGAGGTTT 



(2) INFORMATION FOR SEQ ID NO:548: 

(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA * 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:548. 

TGGAGGTTTG 



(2) INFORMATION FOR SEQ ID NO:549: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:549: 
GGAGGTTTGA 



(2) INFORMATION FOR SEQ ID NO:550: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:550: 
GAGGTTTGAC 

(2) INFORMATION FOR SEQ ID NO:551: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:551: 
AGGTTTGACA 

(2) INFORMATION FOR SEQ ID NO:552: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:552: 
GGTTTGACAG 

(2) INFORMATION FOR SEQ ID NO:553: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ IDNO:553: 

GTTTGACAGC 

(2) INFORMATION FOR SEQ ID NO:554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:554: 
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TTTGACAGCC 

(2) INFORMATION FOR SEQ ID NO:555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:555: 
TTGACAGCCG 

(2) INFORMATION FOR SEQ ID NO:556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:556: 
TGACAGCCGC 

(2) INFORMATION FOR SEQ ID NO:557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:557: 
GACAGCCGCC 

(2) INFORMATION FOR SEQ ID NO:558: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:558: 
ACAGCCGCCT 

(2) INFORMATION FOR SEQ ID NO:559: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:559: 
CAGCCGCCTA 

(2) INFORMATION FOR SEQ ID NO:560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:560: 
AGCCGCCTAG 

(2) INFORMATION FOR SEQ ID NO:561: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:561: 
GCCGCCTAGC 

(2) INFORMATION FOR SEQ ID NO:562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:562: 
CCGCCTAGCA 

(2) INFORMATION FOR SEQ ID NO:563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:563: 
CGCCTAGCAT 



(2) INFORMATION FOR SEQ ID NO:564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 
GCCTAGCATT 

(2) INFORMATION FOR SEQ ID NO:565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:565: 
CCTAGCATTT 

(2) INFORMATION FOR SEQ ID NO:566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:566: 
CTAGCATTTC 

(2) INFORMATION FOR SEQ ID NO:567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:567: 
TAGCATTTCA 

(2) INFORMATION FOR SEQ ID NO:568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:568: 
AGCATTTCAT 

(2) INFORMATION FOR SEQ ID NO:569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:569: 
GCATTTCATC 

(2) INFORMATION FOR SEQ ID NO:570: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:570: 
CATTTCATCA 

(2) INFORMATION FOR SEQ ID NO:571 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:571: 
ATTTCATCAC 

(2) INFORMATION FOR SEQ ID NO:572: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:572: 
TTTCATCACG 

(2) INFORMATION FOR SEQ ID NO:573: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:573: 
TTCATCACGT 

(2) INFORMATION FOR SEQ ID NO:574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:574: 
TCATCACGTG 

(2) INFORMATION FOR SEQ ID NO:575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:575: 
CATCACGTGG 

(2) INFORMATION FOR SEQ ID NO:576: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:576: 
ATCACGTGGC 

(2) INFORMATION FOR SEQ ID NO:577: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO;577: 
TCACGTGGCC 

(2) INFORMATION FOR SEQ ID NO:578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:578: 
CACGTGGCCC 

(2) INFORMATION FOR SEQ ID NO:579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:579: 

ACGTGGCCCG 

(2) INFORMATION FOR SEQ ID NO:580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:580: 
CGTGGCCCGA 

(2) INFORMATION FOR SEQ ID NO:58 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58I: 
GTGGCCCGAG 

(2) INFORMATION FOR SEQ ID NO:582: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:582: 
TGGCCCGAGA 

(2) INFORMATION FOR SEQ ID NO:583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:583: 
GGCCCGAGAG 

(2) INFORMATION FOR SEQ ID NO:584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:584: 
GCCCGAGAGC 

(2) INFORMATION FOR SEQ ID NO:585: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:585: 
CCCGAGAGCT 

(2) INFORMATION FOR SEQ ID NO:586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:586: 
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CCGAGAGCTG 

(2) INFORMATION FOR SEQ ID NO:587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:587: 
CGAGAGCTGC 

(2) INFORMATION FOR SEQ ID NO:588: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:588: 
GAGAGCTGCA 

(2) INFORMATION FOR SEQ ID NO:589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:589: 
AGAGCTGCAT 

(2) INFORMATION FOR SEQ ID NO:590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:590: 

GAGCTGCATC 

(2) INFORMATION FOR SEQ ID NO:591: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:591: 
AGCTGCATCC 

(2) INFORMATION FOR SEQ ID NO:592: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:592: 
GCTGCATCCG 

(2) INFORMATION FOR SEQ ID NO:593: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:593: 
CTGCATCCGG 

(2) INFORMATION FOR SEQ ID NO:594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:594: 
TGCATCCGGA 

(2) INFORMATION FOR SEQ ID NO:595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:595: 
GCATCCGGAG 
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(2) INFORMATION FOR SEQ ID NO:596: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:596: 
CATCCGGAGT 

(2) INFORMATION FOR SEQ ID NO:597: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:597: 
ATCCGGAGTA 

(2) INFORMATION FOR SEQ ID NO:598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:598: 
TCCGGAGTAC 

(2) INFORMATION FOR SEQ ID NO:599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:599: 
CCGGAGTACT 

(2) INFORMATION FOR SEQ ID NO:600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:600: 
CGGAGTACTT 

(2) INFORMATION FOR SEQ ID NO:601: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:601: 
GGAGTACTTC 

(2) INFORMATION FOR SEQ ID NO:602: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:602: 
GAGTACTTCA 

(2) INFORMATION FOR SEQ ID NO:603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:603: 
AGTACTTCAA 

(2) INFORMATION FOR SEQ ID NO:604: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:604: 
GTACTTCAAG 

(2) INFORMATION FOR SEQ ID NO:605: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:605: 
TACTTCAAGA 

(2) INFORMATION FOR SEQ ID NO:606: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 606: 
ACTTCAAGAA 

(2) INFORMATION FOR SEQ ID NO:607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:607: 
CTTCAAGAAC 

(2) INFORMATION FOR SEQ ID NO:608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:608: 
TTCAAGAACT 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:609: 
TCAAGAACTG 

(2) INFORMATION FOR SEQ ID NO:6I0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:610: 
CAAGAACTGC 

(2) INFORMATION FOR SEQ ID NO:61 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61 1: 
AAGAACTGCT 

(2) INFORMATION FOR SEQ ID NO:612: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:612: 
AGAACTGCTG 



(2) INFORMATION FOR SEQ ID NO:613: 

(i) SEQUENCE CHARACTERISTICS: 
(2) INFORMATION FOR SEQ ID NO:609: (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(i) SEQUENCE CHARACTERISTICS: (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6I3: 
GAACTGCTGA 
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(2) INFORMATION FOR SEQ ID NO: 6 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 



GAAGAGATTT 


GGGAGAACAT 


GACCTGGATG 


CAGTGGGAAA 


AAGAAATTCA 


CAATCACACA 


60 


AAATACATAT 


ACTCCTTACT 


TGAAAAATCG 


CAGAACCAAC 


AAGAAAAGAA 


TGAACAAGAA 


120 


CTATTGGAAT 


TGGATCAATG 


GGCAAGTTTG 


TGGAATTGGT 


TTGACATAAC 


AAAATGG CTG 


180 


TGGTATATAA 


AAATATTCAT 


AATGG TAG T A 


GGAGGCTTGA 


TAGG TTTAAG 


AATAGTTTTT 


240 


GCTGTACTTT 


CTATAGTGAA 


TAGAGTTAGG 


CAGGGATACT 


CACCATTGTC 


GTTTCAGACC 


300 


CTCCTCCCAA 


CCCCGAGGGG 


ACCCGACAGG 


CCCGAAGGAA 


TCGAAGAAGA 


AGGTGGAGAG 


360 


AGAGACAGAG 


ACAGATCCAC 


TCGATTAGTA 


CACGGATTCT 


TAGCACTTTT 


CTGGGACGAC 


420 


CTGAGGAGCC 


TGTGCCTCTT 


CCTCTACCAC 


CACTTGAGAG 


ACTTACTCTT 


GATTGTAACA 


480 


AGGATTGTGG 


AACTTCTGGG 


ACG CAGGGG A 


TGGGAAGCCC 


TCAAATATTG 


GTGGAACCTC 


540 


CTAAAGTATT 


GGAGCCAGGA 


ACTGCAGAAG 


AGTGCTGTTA 


TCTTGCTCAA 


TGCCACCGCC 


600 


ATAGCAGTAG 


CTGAGGGGAC 


AGATAGAGTT 


TTAGAAGTAT 


TACAAAGAGC 


T TAT AG AG CT 


660 


ATCCTCCACA 


T AC C TAG AAG 


AATAAGACAG 


GGCCTCGAAA 


TGGCTTTGCT 


ATAAAATGGG 


720 


TGGCAAGTGA 


GCAAAAAGTA 


GTGTAGTCAG 


AT AG CATG CA 


TCATAAGGGG 


TGGGGGCCAA 


780 


CAACTAACAA 


TGCTGATCGT 


GCCTGGC TAG 


AAG CACAAGA 


GAAGGAAGAA 


GCGGGTTTTC 


840 


CAGTCAAACC 


TCAGGTAGCT 


GTAGATCTTA 


GCCACTTTTT 


AAAAGAAAAG 


GGGGGACTGG 


900 


AAGGG CTAAT 


TCACTCCCAA 


AGAAGACAAG 


ATACACAGTG 


CTGCAAACTA 


TTACCAGTGG 


960 


AGTCAGCGAA 


GATAGAAGAG 


GCCAATGGAG 


GAGAAAACCA 


CAGATTGTTC 


TGTTGGGGAC 


1020 


TTTCCATCCG 


TTGGGGACTT 


TCCAAGGCGG 


CGTGGCCTGG 


GTGACTAGTT 


CCGGTGGGGA 


1080 


CTTTCCAAGA 


AGGCGCGGCC 


TGGGCGGGAC 


TGGGGAGTGG 


CGAGCCCTCA 


G ATG CTG CAT 


1140 


ATAAGCAGCT 


GCTTTCTGCT 


GTTACTGGGT 


CTCTCGGGTT 


AGACCAGATC 


TGAGC CTGGG 


12 00 


AGCTCTCTGG 


CTAACTAGGG 


AACCCACTGC 


TTAAG CCTCA 


ATAAAGCTTG 


CCTTGAGTGC 


1260 


TTCAAGTAGT 


GTGTGCCCGT 


CTGTTGTGTG 


ACTCTGGTAT 


CTAGA 




1305 
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(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 



GAAACAATTT 


GGGATAACAT 


GACCTGGATG 


CAGTGGGAAA 


GAGAAATTGA 


CAATTACACA 


60 


AACATAATAT 


ACACCTTAAT 


TGAAGAAT CG 


CAGAACCAAC 


AAGAAAAAAA 


TGAACTAGAA 


120 


TTATTGGAAT 


TGGATAAATG 


GG CAAATTTG 


TGGAATTGGT 


TTAGTATATC 


AAACTGGCTA 


180 


TGGTATATAA 


AATTATTCAT 


AATGGTAGTA 


GGAGG CTTGG 


TAGGTTTAAG 


AATAGTTTTT 


240 


ACTGTACTTT 


CTATAGTGAA 


TAGAGTTAGG 


CAGGGATACT 


CACCATTGTC 


GTTTCAGACC 


300 


CACCTCCCAA 


CCCCGAAGGG 


ACCCGACAGG 


CCAGAAGGAA 


TCGAAGAAGA 


AGGTGGAGAG 


360 


AGAGACAGAG 


GCAGCTCCAC 


TCGATTAGTG 


CACGGATTCT 


TAGCACTTTT 


CTGGGACGAC 


420 


CTGAGGAGTC 


TGTGCCTCTT 


CAGCTACCAC 


CACTTGAGAG 


ACTTACTCTT 


GATTGTAACG 


480 


AGGATTGTGG 


AACTTCTGGG 


ACG CAGGGG A 


TGGGAAGCCC 


TCAAATACTG 


GTGGAATCTC 


540 


CTGCAGTATT 


GGAGG CAGG A 


ACTACAGAAG 


AGTGCTGTTA 


GCTTGTTCAA 


TGGCACGGCC 


600 


ATAGCAGTAG 


CTGAGGGGAC 


AGATAGAGTT 


ATAGAAGCTT 


TACGAAGGGC 


TTATAGAGCT 


660 


ATTCTCCACA 


TACCTAGAAG 


AATAAGACAG 


GGCTTAGAAA 


GGGCTTTGCT 


ATAAAATGGG 


720 


TGGCAAGTGG 


TCAGAAAGTA 


GTGTGGTTAG 


AAGGCATGTA 


CCTTTAAGAC 


AAGGCAGCTA 


780 


TAGATCTTAG 


CCGCTTTTTA 


AAAGAAAAGG 


GGGGACTGGA 


AGGGCTAATT 


CACTCACAGA 


840 


GAAGATCAGT 


TGAACCAGAA 


GAAGATAGAA 


GAGGCCATGA 


AGAAGAAAAC 


AACAGATTGT 


900 


TCCGTTTGTT 


CCGTTGGGGA 


CTTTCCAGGA 


GACGTGGCCT 


GAGTGATAAG 


CCGCTGGGGA 


960 


CTTTCCGAAG 


AGGCGTGACG 


GGACTTTCCA 


AGGCGACGTG 


GCCTGGGCGG 


GACTGGGGAG 


1020 


TGGCGAGCCC 


TCAGATGCTG 


CATATAAGCA 


GCTGCTTTCT 


GCCTGTACTG 


GGTCTCTCTG 


1080 


GTTAGACCAG 


ATCTGAGCCT 


GGGAGCTCTC 


TGGCTAACTA 


GGGAACCCAC 


TGCTTAAGCC 


1140 


TCAATAAAGC 


TTGCCTTGAG 


TGCTTCAAGT 


AGTGTGTGCC 


CGTCTGTTGT 


GTGACTCTGG 


1200 



TATCTAGA 
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(2) INFORMATION FOR SEQ ID NO:6l6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:616: 
TGG AAGGGCT AATTTGGT 1 g 



(2) INFORMATION FOR SEQ ID NO:6 17:: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:617: 
ATCTTCCCTA AAAAATTAGC CTGTC 25 



(2) INFORMATION FOR SEQ ID NO:6I8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:618: 
AGGCTCAGAT CTGGTCTAAC 20 



(2) INFORMATION FOR SEQ ID NO:619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:619: 
AGCAGCAGGA AGCACTATGG 20 
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(2) INFORMATION FOR SEQ ID NO:620: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:620: 
TGCTAGAGAT TTTCCACAC 19 



(2) INFORMATION FOR SEQ ID NO:621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:621: 
AGTGAATAGA GTTAGGCAGG 20 



(2) INFORMATION FOR SEQ ID NO:622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:622: 
GTAAGACAGT ATGATCAGAT A 21 



(2) INFORMATION FOR SEQ ID NO:623: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:623: 



TTGTAGGGAA TTCCAAATTC C 



21 



WO 95/21912 



PCT/AU95/00063 



- 152 - 



(2) INFORMATION FOR SEQ ID NO:624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:624: 
CAGGATCCTA CACCTGTCAA CATAAT 26 



(2) INFORMATION FOR SEQ ID NO:625: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:625: 
GGGAATTCCT TATTCCTGCT TG 22 



(2) INFORMATION FOR SEQ ID NO:626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:626: 
CCAGAAGTTC CACAATCC 18 



(2) INFORMATION FOR SEQ ID NO:627: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:627: 
TTCTTCTAGG TATGTGGAG 
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(2) INFORMATION FOR SEQ ID NO:628: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:628: 
AGTGAATTAG CCCTTCCAG 19 



(2) INFORMATION FOR SEQ ID NO:629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:629: 
TGCTAGAGAT TTTCCACAC 19 



(2) INFORMATION FOR SEQ ID NO:630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:630: 
TGCTCTGGAA AACTCAT 17 



(2) INFORMATION FOR SEQ ID NO:631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:631: 



CTTTCTATAG TGAATAGAG 



19 
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(2) INFORMATION FOR SEQ ID NO: 6 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:.632: 
TATTGGAGTC AGGAACT 17 



(2) INFORMATION FOR SEQ ID NO:633: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:633: 
GGTCTAACCA GAGAGAC 17 



(2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:634: 

Ala Val Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala 
15 10 



(2) INFORMATION FOR SEQ ID NO: 63 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

Pro Thr Ser Gin Ser Arg Gly Asp Pro Thr Gly Pro Lys Glu Lys 
1 5 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Thr Asp Pro Phe Asp Thr 
20 25 30 

Asp Pro His 
35 
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{2} INFORMATION FOR SEQ ID NO: 63 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 


MOLECULE TYPE : protein 












(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


636 










Pro 
1 


Ser Ser Gin Pro Arg Gly Asp Pro 
5 


Thr 
10 


Gly 


Pro Lys 


Glu 


Ser Lys 
15 


Lys 


Lys Val Glu Arg Glu Thr Glu Thr 
20 25 


Asp 


Pro 


Leu Asp 


Tyr 

30 


Thr 


Asp 


Ser His 
35 













(2) INFORMATION FOR SEQ ID NO: 63 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:637: 

Pro Thr Ser Gin Pro Arg Arg Asp Pro Thr Gly Gin Lys Glu Ser Lys 
1 ■ 5 10 15 

Lvs Lys Val Glu Arg Glu Thr Glu Ala Ala Pro Leu Asp Cys Thr 
20 25 30 

Asp Ser His 
35 



(2) INFORMATION FOR SEQ ID NO: 63 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 8: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
! * 5 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin He His Ser He Ser Glu 
20 25 30 

Arg He Leu Ser Thr Tyr Leu Gly Arg Ser Ala Glu Pro Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cys 
50 55 60 

Glv Thr Ser Gly Thr Gin Gly Val Gly Ser Pro Gin He Leu Val Glu 
SS 70 75 80 
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Ser Pro Thr-Val Leu Glu Ser Gly Thr Lys Glu Cys Cys Leu 

85 90 95 

Ala Gin Cys His Ser His Ser Ser Ser Gly Asp Arg 

100 105 no 



(2) INFORMATION FOR SEQ ID NO: 63 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 63 9: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
15 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin He His Ser He Ser Thr 
20 25 30 

Arg He Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Pro Val Pro Leu 
35 40 45 

Pro Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Lys Aep Cys 
50 55 go 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
65 7 0 75 80 

Pro Pro Lys Val Leu Glu Pro Gly Thr Ala Glu Glu Cys Cys Tyr Leu 
85 90 95 

Ala Gin Cys His Arg His Ser Ser Ser Gly Asp Arg 

100 105 no 



(2) INFORMATION FOR SEQ ID NO: 64 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
1 5 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin Leu His Ser He Ser Ala 
20 25 30 

Arg He Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Ser Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cvs 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
65 70 75 80 

Ser Pro Ala Val Leu Glu Ala Gly Thr Thr Glu Glu Cys Cys Leu 
85 90 95 
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Val Gin Trp His Gly His Ser Ser Ser Gly Asp Arg 
100 105 



110 



(2) INFORMATION FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

Glu Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 
15 10 15 

Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys 
50 55 60 

Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
65 70 75 80 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr His Leu Pro He Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 HO 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg 
115 120 125 

Leu Val Asn Gly Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala 
180 185 190 

Val Asn Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val He Glu Val Leu Gin Ala Ala Tyr Arg Ala He Arg His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Arg He Leu Leu 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 642: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

Glu Glu lie Trp Glu Asn Met Thr Trp Met Gin Trp Glu Lys Glu lie 
1 5 10 is 

His Asn His Thr Lys Tyr lie Tyr Ser Leu Leu Glu Lys Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Gin Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asp lie Thr Lys Trp Leu Trp Tyr He Lys 
50 55 60 

He Phe He Met Val Val Gly Gly Leu He Gly Leu Arg He Val Phe 
65 70 75 80 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr Leu Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly He Glu Glu Met Gly Gly Glu Arg Asp Arg Asp Arg Ser Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu Phe Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Leu Tyr His His Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Lys Tyr Trp Ser Gin Glu Leu Gin Lys Ser Ala 
180 185 190 

Val He Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val Leu Glu Val Leu Gin Arg Ala Tyr Arg Ala He Leu His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Met Ala Leu Leu 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO:643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 43: 

Glu Thr He Trp Asp Asn Met Thr Trp Met Gin Trp Glu Arg Glu He 
15 10 15 

Asp Asn Tyr Thr Asn He He Tyr Thr Leu lie Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Leu Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Asn Leu Trp Asn Trp Phe Ser He Ser Asn Trp Leu Trp Tyr He Lys 
50 55 60 

Leu Phe He Met Val Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
65 70 75 80 

Thr Val Leu Ser lie Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr His Leu Pro Thr Pro Lys Gly Pro Asp Arg Pro Glu 
100 105 HO 

Gly lie Glu Glu Glu Gly Gly Glu Arg Asp Arg Gly Ser Ser Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu Phe Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Ser Tyr His His Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 

Arq He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Gin Tyr Trp Arg Gin Glu Leu Gin Lys Ser Ala 
180 185 190 

Val Ser Leu Phe Asn Gly Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val He Glu Ala Leu Arg Arg Ala Tyr Arg Ala He Leu His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Arg Ala Leu Leu 



225 



230 



235 
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(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 
(BJ TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

Met Gly Gly Lys Trp Ser Lys Ser Ser Val He Gly Trp Pro Ala Val 
1 5 io . 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 
20 25 30 

Val Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 
35 40 45 

Ala Ala Asn Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 
50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 * 75 80 

Tyr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 
85 90 95 

Leu Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 
100 105 HO 

Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 
115 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 
130 135 140 

Leu Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu 
145 150 155 160 

Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro 
165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His 
180 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 64 5: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 5: 

Met Gly Gly Lys Ala Lys Ser Ser Val Val Arg His Ala Ser 
1 5 10 15 



Gly Val Gly Ala Asn Asn Gin Cys 

20 25 
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(2) INFORMATION FOR SEQ ID NO: 646: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 

Met Gly Gly Lys Trp Ser Glu Ser Ser Val Val Arg Arg His Val Pro 
1 5 10 15 

Leu Arg Gin Gly Ser Tyr Arg Ser Pro Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 647; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 



CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGGGA 60 
GTGGCGAGCC CTCA 74 



(2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:648: 



CTGTTGGGGA CTTTCCATCC GTTGGGGACT TTCCAAGGCG GCGTGGCCTG GGTGACTAGT 60 
TCCGGTGGGG ACTTTCCA 78 



(2) INFORMATION FOR SEQ ID NO: 64 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 



CCGTTTGTTC CGTTGGGGAC TTTCCAGGAG ACGTGGCCTG AGTGACTAAG CCGCTGGGGA 6 0 

CTTTCCG 6 7 
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(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : • DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG ATTGGATGGC CTGCTGTAAG GGAAAGAATG 6 0 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAG CAGTAT CTCGAGACCT AGAAAAACAT 12 0 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT AACAATGCTG CTTGTGCCTG GCTAGAAGCA .180 

CAAGAGGAGG AAGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 24 0 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AAAGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 3 60 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGG TAG AAGAGGCCAA TAAAGGAGAG 4 80 

AACAC CAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 54 0 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTT CATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCTG A 621 

(2) INFORMATION FOR SEQ ID NO: 6 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 



GAACAGATTT 


GG AATAACAT 


GACCTGGATG 


GAGTGGGACA 


GAGAAATTAA 


CAATTACACA 


60 


AG CT TAATAC 


ACTCCTTAAT 


TGAAGAATCG 


CAAAACCAGC 


AAGAAAAGAA 


TGAACAAGAA 


120 


TTATTGGAAT 


TAGATAAATG 


GGCAAGTTTG 


TGGAATTGGT 


TTAACATAAC 


AAATTGGCTG 


180 


TGGTATATAA 


AATTATTCAT 


AATGATAGTA 


GGAGGCTTGG 


TAGG TTTAAG 


AATAGTTTTT 


240 


GCTGTACTTT 


CTATAGTGAA 


TAGAGTTAGG 


CAGGGATATT 


CACCATTATC 


GTTTCAGACC 


300 


CACCTCCCAA 


TCCCGAGGGG 


ACCCGACAGG 


CCCGAAGGAA 


TAGAAGAAGA 


AGGTGGAGAG 


360 


AGAGACAGAG 


ACAGATCCAT 


TCGATTAGTG 


AACGGATCCT 


TAGCACTTAT 


CTGGGACGAT 


420 


CTGCGGAGCC 


TGTGCCTCTT 


CAGCTACCAC 


CG CTTG AG AG 


ACTTACTCTT 


GATTGTAACG 


480 


AGGATTGTGG 


AACTTCTGGG 


ACGCAGGGGG 


TGGGAAGCCC 


TCAAATATTG 


GTGGAATCTC 


540 


CTACAGTATT 


GGAGTCAGGA 


ACTAAAGAAT 


AGTG CTGTT A 


ACTTGCTCAA 


TGCCACAGCC 


600 


ATAGCAGTAG 


CTGAGGGGAC 


AGATAGGGTT 


ATAGAAGTAT 


TACAAGCAGC 


TTATAGAGCT 


660 


ATTCGCCACA 


TACCTAGAAG 


AATAAGACAG 


GGCTTGGAAA 


GG ATTTTGCT 


ATAAGATGGG 


720 


TGG CAAGTGG 


TCAAAAAGTA 


GTGTGATTGG 


ATGGCCTGCT 


GTAAGGGAAA 


GAATGAGACG 


780 
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AG C TG AG C CA 


VjUAvjUAIjA 1 \j 


nrzrzTczacz'h.ar' 

uOU 1 uuvjnuL 


a c^T a t PT PPr A 


G AC PT AG AAA 


AACATGGAGC 


840 




AA 1 LALAAtj 1 




paopta araa 


TP.PTP»PTTGT 


GCCTGGCTAG 


AAGCACAAGA 


900 




Gb Atj Lj A(j 


o 1 buu 1 1 I 1L 


r 1 arTp a p a p p 


TPAfXf^TAPPT 


T TAAG AC CAA 


TGACTTACAA 


960 




(j<jvJALjL_ 1 L» 1 A 


UA1 L. 1 I AuLL 


ft PTTTTT AAA 
KL 11X1 1 AAA 




GGACTG G AAG 


GGCTAATTCA 


1020 




CT L. C LAAAbA 


AoALAAbA 1 A 


1 1_ V* 1 1 WjA 1 \— 1 


TTrraTfTar 

Lj 1 vjvjrt 1 \~ 1 MVk- 


PAPA PAPA AG 


GCTACTTCCC 


1080 




ibAI HjVjL-AVj 


AAL. 1 /iLALAL 




rtnTpaf^aTAT 

VJO 1 k»>\LJ/\ 1 .rV 1 


PCACTGACCT 


TTGGATGGTG 


1140 








AGP PAG AT AA 


GGTAGAAGAG 


GC CAATAAAG 


GAGAGAACAC 


1200 




CAGCTTGTTA 


CACCCTGTGA 


GCCTGCATGG 


AATGGATGAC 


CCTGAGAGAG 


AAGTG TTAGA 


1260 




GTGGAGGTTT 


GACAGCCGCC 


TAGCATTTCA 


TCACGTGGCC 


CGAGAGCTGC 


ATCCGGAGTA 


lizU 




CTTCAAGAAC 


TGCTGACATC 


GAGCTTGCTA 


CAAGGGACTT 


TCCGCTGGGG 


ACTTTCCAGG 


1380 




GAGGCGTGGC 


CTGGGCGGGA 


CTGGGGAGTG 


GCGAGCCCTC 


AGATGCTGCA 


TATAAGCAGC 


1440 




TGCTTTTTGC 


CTGTACTGGG 


TCTCTCTGGT 


TAGACCAGAT 


CTGAGCCTGG 


GAGCTCTCTG 


1500 




GCTAACTAGG 


GAACCCACTG 


CTTAAGCCTC 


AATAAAGCTT 


GCCTTGAGTG 


CTTCAAGTAG 


1560 




TGTGTGCCCG 


TCTGTTGTGT 


GACT CTGGTA 


ACTAGA 






1596 
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(2) INFORMATION FOR SEQ ID NO:652: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:652: 
GCTTTTTGCC 



(2) INFORMATION FOR SEQ ID NO:653: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.653: 
CTTTTTGCCT 



(2) INFORMATION FOR SEQ ID NO:654: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:654: 
TTTTTGCCTG 



(2) INFORMATION FOR SEQ ID NO:655: 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:656: 
TTTGCCTGTA 



(2) INFORMATION FOR SEQ ID NO:657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:657: 
TTGCCTGTAC 



(2) INFORMATION FOR SEQ ID NO:658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:658: 
TGCCTGTACT 



(2) INFORMATION FOR SEQ ID NO:659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:655: 
TTTTGCCTGT 

(2) INFORMATION FOR SEQ ID NO:656: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:659: 
GCCTGTACTG 

(2) INFORMATION FOR SEQ ID NO:660: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:660: 
CCTGTACTGG 
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(2) INFORMATION FOR SEQ ID NO:661: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:661: 
CTGTACTGGG 

(2) INFORMATION FOR SEQ ID NO:662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:662: 
TGTACTGGGT 

(2) INFORMATION FOR SEQ ID NO:663: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:663: 
GTACTGGGTC 

(2) INFORMATION FOR SEQ ID NO:664: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:664: 
TACTGGGTCT 

(2) INFORMATION FOR SEQ CD NO:665: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:665: 
ACTGGGTCTC 

(2) INFORMATION FOR SEQ ID NO:666: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:666: 
CTGGGTCTCT 

(2) INFORMATION FOR SEQ ID NO:667: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:667: 
TGGGTCTCTC 

(2) INFORMATION FOR SEQ ID NO:668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:668: 
GGGTCTCTCT 

(2) INFORMATION FOR SEQ ID NO:669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:669: 
GGTCTCTCTG 

(2) INFORMATION FOR SEQ ID NO:670: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:670: 
GTCTCTCTGG 

(2) INFORMATION FOR SEQ ID NO:67l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ lDNO:671: 
TCTCTCTGGT 

(2) INFORMATION FOR SEQ ID NO:672: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:672; 
CTCTCTGGTT 

(2) INFORMATION FOR SEQ ID NO:673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:673: 
TCTCTGGTTA 

(2) INFORMATION FOR SEQ ID NO:674: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:674: 
CTCTGGTTAG 

(2) INFORMATION FOR SEQ ID NO:675; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:675: 
TCTCTGGTTA 

(2) INFORMATION FOR SEQ ID NO:676: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:676: 
CTGGTTAGAC 

(2) INFORMATION FOR SEQ ID NO:677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:677: 
TGGTTAGACC 

(2) INFORMATION FOR SEQ ID NO:678: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:678: 
GGTTAGACCA 

(2) INFORMATION FOR SEQ ID NO:679: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:679: 
GTTAGACCAG 



(2) INFORMATION FOR SEQ ID NO:680: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:680: 
TTAGACCAGA 



(2) INFORMATION FOR SEQ ID NO:681: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:681: 
TAGACCAGAT 



(2) INFORMATION FOR SEQ ID NO:682: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:682: 
AGACCAGATC 



(2) INFORMATION FOR SEQ ID NO:683: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:683: 
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GACCAGATCT 



(2) INFORMATION FOR SEQ ID NO:684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:684: 
ACCAGATCTG 



(2) INFORMATION FOR SEQ ID NO:685: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:685: 
CCAGATCTGA 



(2) INFORMATION FOR SEQ ID NO:686: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:686: 
CAGATCTGAG 



(2) INFORMATION FOR SEQ ID NO:687: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:687: 
AGATCTGAGC 



(2) INFORMATION FOR SEQ ID NO:688: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDKESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:688: 
GATCTGAGCC 



(2) INFORMATION FOR SEQ ID NO:689: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:689: 
ATCTGAGCCT 



(2) INFORMATION FOR SEQ ID NO:690: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:690: 
TCTGAGCCTG 



(2) INFORMATION FOR SEQ ID NO:691: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:69I: 
CTGAGCCTGG 



(2) INFORMATION FOR SEQ ID NO:692: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:692: 
TGAGCCTGGG 
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(2) INFORMATION FOR SEQ ID NO:693: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:693: 
GAGCCTGGGA 



(2) INFORMATION FOR SEQ ID NO:694: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:694: 
AGCCTGGGAG 



(2) INFORMATION FOR SEQ ID NO:695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ lDNO:695: 
GCCTGGGAGC 



(2) INFORMATION FOR SEQ ID NO:696: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:696: 
CCTGGGAGCT 



(2) INFORMATION FOR SEQ ID NO:697: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:697: 
CTGGGAGCTC 



(2) INFORMATION FOR SEQ ID NO:698: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:698: 
TGGGAGCTCT 



(2) INFORMATION FOR SEQ ID NO:699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:699: 
GGGAGCTCTC 



(2) INFORMATION FOR SEQ ID NO:700: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:700: 
GGAGCTCTCT 



(2) INFORMATION FOR SEQ ID NO:701: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:701 
GAGCTCTCTG 



(2) INFORMATION FOR SEQ ID NO:702: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:702: 
AGCTCTCTGG 



(2) INFORMATION FOR SEQ ID NO:703: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:703: 
GCTCTCTGGC 



(2) INFORMATION FOR SEQ ID NO:704: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single -lw 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:704: 
CTCTCTGGCT 



(2) INFORMATION FOR SEQ ID NO:705: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:705: 
TCTCTGGCTA 



(2) INFORMATION FOR SEQ ID NO:706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:706: 
CTCTGGCTAA 

(2) INFORMATION FOR SEQ ID NO:707: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:707: 
TCTGGCTAAC 

(2) INFORMATION FOR SEQ ID NO:708: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:708: 
CTGGCTAACT 

(2) INFORMATION FOR SEQ ID NO:709: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:709: 
TGGCTAACTA 

(2) INFORMATION FOR SEQ ID NO:710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:710; 
GGCTAACTAG 

(2) INFORMATION FOR SEQ ID NO:71 1: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:711: 
GCTAACTAGG 

(2) INFORMATION FOR SEQ ID NO:712: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:712: 
CTAACTAGGG 

(2) INFORMATION FOR SEQ ID NO:7I3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:713: 
TAACTAGGGA 

(2) INFORMATION FOR SEQ ID NO:714: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:7l4: 
AACTAGGGAA 

(2) INFORMATION FOR SEQ ID NO:7I5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ H)NO:715: 
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ACTAGGGAAC 

(2) INFORMATION FOR SEQ ID NO:716: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:716: 
CTAGGGAACC 

(2) INFORMATION FOR SEQ ID NO:717: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ IDNO:717: 
TAGGGAACCC 

(2) INFORMATION FOR SEQ ID NO:7t8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:718: 
AGGGAACCCA 

(2) INFORMATION FOR SEQ ID NO:719: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:719: 
GGGAACCCAC 

(2) INFORMATION FOR SEQ ID NO:720: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:720: 
GGAACCCACT 

(2) INFORMATION FOR SEQ ID NO:72 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:721: 
GAACCCACTG 

(2) INFORMATION FOR SEQ ID NO:722: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ lDNO:722: 

AACCCACTGC 

(2) INFORMATION FOR SEQ ID NO:723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723: 
ACCCACTGCT 

(2) INFORMATION FOR SEQ ED NO:724: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:724: 
CCCACTGCTT 
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(2) INFORMATION FOR SEQ ID NO:725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:725: 
CCACTGCTTA 

(2) INFORMATION FOR SEQ ID NO:726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:726: 
CACTGCTTAA 

(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:727: 
ACTGCTTAAG 

(2) INFORMATION FOR SEQ ID NO:728: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:728: 
CTGCTTAAGC 

(2) INFORMATION FOR SEQ ID NO:729: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:729: 
TGCTTAAGCC 

(2) INFORMATION FOR SEQ ID NO:730: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:730: 
GCTTAAGCCT 

(2) INFORMATION FOR SEQ ID NO:73I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73l: 
CTTAAGCCTC 

(2) INFORMATION FOR SEQ ID N0.732: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:732: 
TTAAGCCTCA 

(2) INFORMATION FOR SEQ ID NO:733: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:733: 
TAAGCCTCAA 

(2) INFORMATION FOR SEQ ID NO:734: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:734: 
AAGCCTCAAT 

(2) INFORMATION FOR SEQ ID NO:735: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0 735: 
AGCCTCAATA 

(2) INFORMATION FOR SEQ ID NO: 736. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:736: 
GCCTCAATAA 

(2) INFORMATION FOR SEQ ED NO:737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:737: 
CCTCAATAAA 

(2) INFORMATION FOR SEQ ID NO:738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:738: 
CTCAATAAAG 

(2) INFORMATION FOR SEQ ID NO:739: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:739: 
TCAATAAAGC 

(2) INFORMATION FOR SEQ ID NO:740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:740: 
CAATAAAGCT 

(2) INFORMATION FOR SEQ ID NO:741: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74I: 
AATAAAGCTT 

(2) INFORMATION FOR SEQ ID NO:742: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:742: 
ATAAAGCTTG 

(2) INFORMATION FOR SEQ ID NO:743: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:743: 
TAAAGCTTGC 

(2) INFORMATION FOR SEQ ID NO:744: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:744: 
AAAGCTTGCC 

(2) INFORMATION FOR SEQ ID NO:745: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:745: 
AAGCTTGCCT 

(2) INFORMATION FOR SEQ ED NO:746: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:746: 
AGCTTGCCTT 

(2) INFORMATION FOR SEQ ID NO:747: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:747: 



- 174 - 

GCTTGCCTTG 

(2) INFORMATION FOR StQ ID NO:748: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:748: 
CTTGCCTTGA 

(2) INFORMATION FOR SEQ ID NO:749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:749: 
TTGCCTTGAG 

(2) INFORMATION FOR SEQ ID NO:750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:750: 
TGCCTTGAGT 

(2) INFORMATION FOR SEQ ID NO:751: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:751: 
GCCTTGAGTG 

(2) INFORMATION FOR SEQ ID NO:752: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:752: 
CCTTGAGTGC 



(2) INFORMATION FOR SEQ ID NO:753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:753: 
CTTGAGTGCT 



(2) INFORMATION FOR SEQ ID NO:754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:754: 
TTGAGTGCTT 



(2) INFORMATION FOR SEQ ID NO:755: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:755: 
TGAGTGCTTC 



(2) INFORMATION FOR SEQ ID NO:756: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:756: 
GAGTGCTTCA 
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(2) INFORMATION FOR SEQ ID NO:757: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:757: 
AGTGCTTCAA 



(2) INFORMATION FOR SEQ ID NO:758: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.758: 
GTGCTTCAAG 



(2) INFORMATION FOR SEQ ID NO:759: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:759; 
TGCTTCAAGT 



(2) INFORMATION FOR SEQ ID NO:760: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:760: 
GCTTCAAGTA 



(2) INFORMATION FOR SEQ ID NO:76l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



WO 95/21912 




(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:761: 
CTTCAACTAG 



(2) INFORMATION FOR SEQ ID NO:762; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:762: 
TTCAAGTAGT 



(2) INFORMATION FOR SEQ ID NO:763: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:763: 
TCAAGTAGTG 



(2) INFORMATION FOR SEQ ID NO: 764: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764: 
CAAGTAGTGT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:766: 
AGTAGTGTGT 



(2) INFORMATION FOR SEQ ID NO:767: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:767: 
GTAGTGTGTG 



(2) INFORMATION FOR SEQ ID NO:768: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:768: 
TAGTGTGTGC 



(2) INFORMATION FOR SEQ ID NO:769: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



00 MOLECULE TYPE: DNA 

(2) INFORMATION FOR SEQ ID NO:765: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs AGTGTGTGCC 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear (2) INFORMATION FOR SEQ ID NO:770: 

(ii) MOLECULE TYPE: DNA (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:765: (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
AAGTAGTGTG (D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(2) INFORMATION FOR SEQ ID NO:766: 
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(xi) SEQUENCE DESCRIPTION: SEQ IDNO:770: 
GTGTGTGCCC 



(2) INFORMATION FOR SEQ ID NO:77l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:771: 
TGTGTGCCCG 



(2) INFORMATION FOR SEQ ID NO:772: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:772: 
GTGTGCCCGT 



(2) INFORMATION FOR SEQ ID NO:773: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:773: 
TGTGCCCGTC 



(2) INFORMATION FOR SEQ ED NO:774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.774: 
GTGCCCGTCT 



(2) INFORMATION FOR SEQ ID NO: 775: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:775: 
TGCCCGTCTG 



(2) INFORMATION FOR SEQ ID NO:776: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:776: 
GCCCGTCTGT 



(2) INFORMATION FOR SEQ ID NO:777: - ';. 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs ; * 1 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single >>- 

(D) TOPOLOGY: linear 1 f 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:777: 
CCCGTCTGTT 



(2) INFORMATION FOR SEQ ID NO:778: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:778: 
CCGTCTGTTG 



(2) INFORMATION FOR SEQ ID NO:779: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:779: 
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CGTCTGTTGT 

(2) INFORMATION FOR SEQ ID NO:780: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:780: 
GTCTGTTGTG 

(2) INFORMATION FOR SEQ ID NO:781 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:781: 
TCTGTTGTGT 

(2) INFORMATION FOR SEQ ID NO:782: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:782: 
CTGTTGTGTG 

(2) INFORMATION FOR SEQ ID NO:783: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:783: 
TGTTGTGTGA 

(2) INFORMATION FOR SEQ ID NO:784: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:784: 
GTTGTGTGAC 

(2) INFORMATION FOR SEQ ID NO:785: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:785: 
TTGTGTGACT 

(2) INFORMATION FOR SEQ ID NO:786: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:786: 
TGTGTGACTC 

(2) INFORMATION FOR SEQ ID NO:787: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:787: 
GTGTGACTCT 

(2) INFORMATION FOR SEQ ID NO:788: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:788: 
TGTGTGACTC 
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(2) INFORMATION FOR SEQ ID NO:789: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:789: 
GTGTGACTCT 

(2) INFORMATION FOR SEQ ID NO:790: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790: 
TGTGACTCTG 

(2) INFORMATION FOR SEQ ID NO:791 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:791: 
GTGACTCTGG 

(2) INFORMATION FOR SEQ ID NO:792: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:792: 
TGACTCTGGT 

(2) INFORMATION FOR SEQ ID NO:793: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:793: 
GACTCTGGTA 

(2) INFORMATION FOR SEQ ID NO:794: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:794: 
ACTCTGGTAA 

(2) INFORMATION FOR SEQ ID NO: 795: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:795: 
CTCTGGTAAC 

(2) INFORMATION FOR SEQ ID NO:796: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:796: 
TCTGGTAACT 

(2) INFORMATION FOR SEQ ID NO:797: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:797: 
CTGGTAACTA 



WO 95/21912 



PCT/AU95/00063 



- 180 - 



(2) INFORMATION FOR SEQ ID NO:798: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:798: 
TGGTAACTAG 



(2) INFORMATION FOR SEQ ID NO:799: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:799: 
GGTAACTAGA 
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(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 

TGGAAGGGCT AATTCACTCA CGGAAAAGAC CAGTTGAACC AGAAGAAGAT AGAAGAGGCC 6 0 

ATGAAGAAGA AAACAACAGA TTGTTCTGCT TGCTCAGCTG GGGACTTTCC AGAAGGCGCG 12 0 

GCCTGAGTGA CTAAGCCCCG TTGGGGACTT TCCGAAGAGG CATGAAGGGA CTTTCCAAGG 180 

CAGGCGTGGC CTGGGCGGGA CTGGGGAGTG GCGAGCCCTC AGATGCTGCA TATAAGCAGC 24 0 

TGCTTTCTGC CTGTACTGGG TCTCTCTGGT TAGACCAGAT CTGAGCCTGG 'GAGCTCTCTG 300 

GCTAGCTAGG GAACCCACTG CTTAAGCCTC AATAAAGCTT GCCTTGAGTG CTTCAAGTAG 36 0 

TGTGTGCCCG TCTGTTGTGT GACT CTGGT A TCTAGAGATC CCTCAGACCA TTTTAGTCCG 42 0 

TGTGGAAAAT CTCTAGCAGT GGCGCCCGAA CAGGGACTTG AAAGCGAAAG GAAAACCAGA 48 0 

GGAGCTCTCT CGACGCAGGA CTCGGCTTGC TGAAGCGCGC ACGGCAAGAG GCGAGGGGCG 54 0 

GCGACTGGTG AGTACG CCG A AAATTTTGAC TAGCGGAGGC TAGAAGGAGA GAGATGGGTG 6 00 

CGAGAGCGTC AATATTAAGC GGGGG AAAAT TAGATAGATG GGAGAAAATT CGGTTAAGGC 660 

CAGGAGGAAA GAAAAAGTAT AAATTAAAAC ATATAGTATG GGCAAGCAGG GAGCTAGAAC 720 

GATTCGCAGT CAATCCTGGC CTGTTGGAAA CATCAGAAGG CTGTAGACAA ATACTGGGAC 780 

AGTTACACCC GTCCCTTCAG ACAGG AT CAG AAGAACTTAA ATCAGTATAT AATGCAGTAG 84 0 

CAGTCCTCTA TTGTGTGCAT CAAAACATAG ACATAAAGGA CACCAAGGAA GCTTTAGAAA 900 

AGATAGAGGA AGAGCAAAAC AAATGTAAGA AAAAAGCACA GCAAGCAGCA GCACAGCAAG 960 

CAGCAGCTGG CACAGGAAAC AGCAACCCGG TCAGCCAAAA TTAC CCTATA GTACAGAACA 102 0 

TGCAGGGGCA AATGGTACAT CAGG C CAT AT CACCTAGAAC TTTAAATGCA TGGGTAAAAG 1080 

TAATAGAAGA GAAGGCTTTC AGCCCAGAGG TAATACCCAT GTTTTCAGCA TTATCAGAAG 1140 

GAGCCACCCC ACAAGATTTA AACACCATGC TAAACACAGT GGGGGGACAT CAAGCAGCTA 12 00 

TGCAAATGTT AAAAGAGACC AT CAATG AGG AAGCTGCAGA ATGGGATAGA TTACATCCAG 12 60 

CGCAGGCAGG GCCTGTTGCA C CAGG C CAG A TGAGAG AC C C AAGGGGAAGT GACATAGCAG 132 0 

GAACTACTAG TACCCTTCAG GAACAAATAG GATGGATGAC AGGTAATCCA GCTATCCCAG 13 80 

TAGGAGAAAT CTATAAAAGA TGGATAATCC TGGGATTAAA TAAAATAGTA AGGATGTATA 144 0 

GCCCTATCAG CATTCTGGAC ATAAAACAAG G AC CAAAGG A ACCCTTTAGA GACTATGTAG 150 0 

AC CGGTTCTA TAAAACTCTA AGAG C CGAGC AAGCTACACA GGAGGTAAAA AATTGGATGA 1560 

CAGAAACCTT GTTGGTCCAA AATGCAAACC CAGATTGTAA GACTATTTTA AAAGCATTGG 1620 
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GACCAGCAGC TACACTAGAA GAAATGATGA CAGCATGTCA GGGAGTGGGA GGACCCAGCC 168 0 

ATAAAGCAAG AG TTTTGG C A GAAGCAATGA GCCAAGCAAC AAATGCAGCT ACTGTAATGA 174 0 

TGCAGAGAAG CAATTTTAGA AACCAAAGAA AGAATGTTAA GTGTTTCAAT TGTGGCAAAG 1800 

AAGGGCACAT AG CCAGAAAT TGCAGGGCTC CTAGGAAAAG GGGCTGTTGG AAATGTGGAA 1860 

AGGAAGGACA CCAAATGAAA GATTGTACTG AG AG ACAGG C TAATTTTTTA GGGAAAATCT 192 0 

GGCCTTCCCA CAAGGGGAGG CCAGGGAACT TTCTTCAGAG C AGG.C CAGAA CCAACAGCCC 198 0 

CTCTCCAGGG CAGGCCGGAG CCATCAGCCC CGCCAGAAGA GAGCTTCAGG TTTGGGGAGG 2 04 0 

AGACAACAAC TCCCTCTCAG AAGCAGGAGC CGATAGACAG GGACAGGGAT CTGTATCCTT 2100 

TAGCTTCCCT CAGATCACTC TTTGGCAACG ACCCCTCGTC ACAATAAAGA TAGGGGGGCA 2160 

G CTG AAGGAA GCTCTATTAG ATACAGGAGC AGATGATACA GTATTAGAAG ACATGCATTT 222 0 

GCCAGGAAAA TGGAAACCAA AAATGATAGG GGGAATTGGA GGTTTTATCA AAGTAAAACA 22 8 0 

ATATGATGAA ATTCTTGTAG AAATCTGTGG ACATAAAGCT ATAGGTACAG TATTAGTAGG 234 0 

ACCTACACCT GTCAACATAA TTGGAAGAAA TCTGTTGACT CAGATTGGTT GCACTTTAAA 24 0 0 

TTTTCCCATT AGTCCTATTG AAACTGTACC AGTACAATTA AAGCCAGGAA TGGATGGCCC 2460 

AAAGGTTAAA CAATGGCCAT TGACAGAAGA GAAAATAAAA GCATTAGTAG AAATTTGTAC 252 0 

AGAAATGGAA AAGG AAGGAA AGATTTCAAA AATTGGGCCT GAAAATCCAT ACAATACTCC 2 58 0 

AGTATTTGCC ATAAAGAAAA AAGATGGTAC TAAATGGAGA AAATTAG TAG ATTTCAGAGA 264 0 

CCTTAATAAG AGAACTCAAG ACTTCTGGGA AGTTCAATTA GGAATACCAC ATCCCTCAGG 2700 

ATTAAAAAAG AAAAAATCAG TAACAGTACT GGATGTGGGT GATGCATACT TTTCAGTTCC 2760 

CTTAGATGAA AACTTCAGGA AGTATACTGC ATTTACCATA CCTAGTATAA ATAATGAGAC 282 0 

ACCAGGGATT AG ATAT CAG T ACAATGTGCT TCCACAGGGA TGGAAAGGAT CACCAGCAAT 288 0 

ATTCCAAAGT AGCATGACAA GAATCTTAGA GCCTTTTAGA AGACAAAATC CAGACATAGT 294 0 

TATCTATCAA TACATGGATG ACTTGTATGT AGGATCTGAT TTAGAAATAG GACAGCATAG 3 0 00 

AATAAAAATA GAGGAACTGA GACAACATCT GTTGAAGTGG GG ATT T AC CA CACCAGACAA 3 060 

AAAGCATCAG AAAGAAC CCC CATTCCTTTG GATGGGTTAT GAACTCCATC CTGATAAATG 312 0 

GACAGTGCAA CCTATAGTAC TGCCAGAAAA AGACAGCTGG ACTGTCAATG ACATACAGAA 318 0 

GTTAGTGGGT AAATTAAATT GGGCAAGTCA G ATTTACC CA GGAATTAAAG T AAGG CAATT 324 0 

ATGTAAACTC CTTAGGGGAA CCAAAGCACT AACAGAAGTA ATACCACTAA CAGAAGAAGC 3300 

AGAGCTAGAA C T GG CAG AAA ACAGGGAAAT TCTAAGAGAA CCAGTACATG GAGTGTATTA 3360 

TGACCCATCA AAAGACTTAA TAGCAGAAAT AC AG AAG CAG GAGCAAGGCC AATGGACATA 34 20 

TCAAATTTAT CAAGATCAAT TTAAAAATCT AAAAACAGGA AAG TATGCAA GATTGAGGGG 34 8 0 

TGCCCACACT AATGATGTAA AACAATTTCC AGAGGCAGTG CAAAAAATAG CCACAGAAAG 354 0 

CATAGTAATA TGGGGAAAGA CTCCTAAATT TAGACTACCC ATACAAAAAG AAACATGGGA 36 00 

CGCATGGTGG ACAGAGTATT GG CAAGCCAC CTGGATTCCT GAGTGGGAGT TTGTCAATAC 3660 
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CCCTCGCCTA 


GTAAAATTAT GGTACCAGTT AGAAAAAGAA CCCATAATAG GAGCAGAAAC 


3720 


TTTCTATGTA 


GATGGGGCAG CTAACAGAGA GACTAAATTA GGAAAAGCAG GATATGTTAC 


3780 


TGACAGAGGA 


AGACAAAAAG TTGTCTCCCT AACTGACACA ACAAATCAGA AGACTGAGTT 


3840 


ACAAGCAATT 


CATCTAGCTT TGCAGGATTC AGGATTAGAA GTAAACATAG TAACAGACTC 


3900 


ACAGTATGCA 


TTAGGAATCA TTCAAGCACA AC CAG AT AAA AGTGAATCAG AAATAGTCAA 


3960 


TCAAATAATA 


GAG CAATTAA TAAAAAAGGA AAAGGTC TAC CTGGCATGGG TACCAGCACA 


4020 


CAAAGGAATT 


GGAGGGAATG AACAAGTAGA TAAATTAGTC AGTGCTGGAA TCAGGAAAAT 


4080 


ACTATTTTTA 


GATGGAATAG ATAAGGCACA AGAAGGCCAT GAGAAATATC ACAGTAATTG 


4140 


GAG AG CAATG 


GCTAGTGGTT TTAACCTGCC ACCTATAGTA GCAAAAGAAA TAGTAGCCAG 


4200 


CTGTGATAAA 


TGTCAGCTAA AAGGAGAAGC CATGCATGGA CAAGTAGACT GTAGTCCAGG 


4260 


AATATGGCAA 


CTAGATTGTA CACATCTAGA AGGAAAAATT ATCCTGGTAG CAGTTCATGT 


4320 


AGCCAGTGGA 


TATATAGAAG CAGAAGTTAT TCCAGCAGAG ACAGGGCAGG AAAC AG CAT A 


4380 


CTTTATCTTA 


AAATTAGCAG GAAGGTGGCC AGTAAACACA ATACATACAG ACAATGGCGG 


4440 


CAATTTCATC 


AGTACCACGG TTAAGGCCGC CTGTT GGTGG GCAGGGATCA AGCAGGAATT 


4500 


TGGCATTCCC 


TACAATCCCC AAAGCCAAGG AGTAGTGGAA TCTATGAATA GAGAATTAAA 


4560 


GAAAATTATA 


GGACAGGTAA GAGAT CAGGC TGAACATCTT AAGACAGCAG TACAAATGGC 


4620 


AGTATTCATC 


CACAATTTTA AAAGAAAAGG GGGGATTGGG GGATACAGTG CAGGGGAAAG 


4680 


AATAGTAGAC 


ATAATAGCAA CAGACATACA AACTAAAGAA TTACAAAAGC AAATTACAAA 


4740 


AATTCAAAAT 


TTTCGGGTTT ATTACAGGGA CAGCAGAGAT CCACTTTGGA AAGGAC CAGC 


4800 


AAAACTTCTC 


TGGAAAGGCG AAGGGGCAGT AGTAATACAA GATAATAGTG ACATAAAAGT 


4860 


AGTGCCAAGA 


AGAAAAGTAA AGATCATTAG GGATTATGGA AAACAGATGG CAGGTGATGA 


4920 


TTGTGTGGCA 


AGTAGACAGG ATGAGGATTA GAACATGGAA CAGTTTAGTG AAACACCATA 


4980 


TGTATGTTTC 


AAAGAAAGCT AAGGGATGGA TTTATAGACA TCACTATGAA AACACTCATC 


5040 


CAAAAATAAG 


CTCAGAAGTA CACATCCCAC TAGGGGAAGC T AG ATTGGT A ATAACAACAT 


5100 


ATTGGGGTCT 


ACATACAGGA GAAAGAG AC T GGCATTTGGG TCAGGGAGTC TCCATAGAAT 


5160 


GGAGGGAAAG 


GACATATAGA ACACAAGTAG ACCCCGAACT AGCAGACCAA CTAATTCATA 


5220 


TACATTACTT 


TGATTGTTTT TCAGAATCTG CCATAAGAAG TGCCATATTA GGATATAGAG 


5280 


TTAGGCATAG 


GTGTGAATAT CAAGCAGGAC ATAACAAGGT AGGATCTCTA CAGTACTTGG 


5340 


CACTAACAGC 


ATTAATAACA CCAAAGAAGA TAAAGCCACC TTTGCCTAGT GTTGCGAAAC 


5400 


TGACAGAGGA 


TAGATGGAAC AAGCCCCAGA AG AC CAAGGG CCACAGAGGC AGCCATACAA 


5460 


TGAATGGACA 


CTAGAACTTT TAGAGGAGCT TAAGAATGAA GCTGTTAGGC ATTTTCCTAG 


5520 


GGTATGGCTC 


CATGGCTTAG GG CAA CAT AT CTATGAAACT TATGGGGATA CTTGGGAAGG 


5580 


AGTGGAGGCC 


ATAACAAGAA CTCTGCAACA ACTGCTGTTT ATTCATTTCA GAATTGGGTG 


5640 


TCAACATAGC 


AGAATAGGCA TTATTCGACA G AGG AG AG CA AGAAATGGAG CCAGTAGATC 


5700 
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CTAGACTAGA GCCCTGGAAG CATCCAGGAA GTCAGCCTAA GACTGCGTGT ACCACTTGCT 5760 

ATTGTAAAAA GTGCTGCTTT CATTGCCAAG TTTGTTTTAT GACAAAAGGC TTAGGCATCT 5820 

CCTATGGCAG GAAGAAGCGG AGACAGCGAC GAAGAGCTCC TCAAGACAGT CAGACTCATC 5880 

AAGCTTATCT ATCAAAGCAG TAAGTAATAT ATGTAATGCA ACCTTTACAA ATAGTAGCAA 5940 

TAGT AG CATT AGTAGTAGCA GGAATAATAG CAATAGTTGT GTGGAC CATA GTATT CATAG 6 000 

AATATAAGAA AATATTAAGA CAAAGAAAAA TAGACAGGTT GATTGATAGA ATAAGAGAAA 6060 

GAGCAGAAGA CAGTGGCAAT GACAGTGAAG GGGATCAGGA AGAATTATCG GCACTTGTGG 612 0 

ACATGGGGCA C CATGATC CT TGGGATATTA ATGATCTGTA GAGCTGCAAA CAATTTGTGG 618 0 

GTCACAGTCT ATTATGGGGT ACCTGTGTGG AGAGAAGCAA CCACCACTCT ATTTTGTGCA 624 0 

TCAGATGCCA AGG CAT ATG A TGCAGAGGTA CATAATGTTT GGGCCACACA TGCCTGTGTA 63 00 

CCCACAGACC CTAACCCACA AGAAGTAGAA TTGAAAAATG TGACAGAAAA TTTTAACATG 6 36 0 

TGGAAAAATA ACATGGTAGA ACAGATG CAT GAGGATATAA TCAGTTTATG GGATCAAAGC 642 0 

CTGAAGCCAT GTGTAAAATT AACCCCACTC TGTGTTTCTT TAAATTGCAC TGATGCTACT 64 8 0 

AATACCACTA ATAGTAATAC CACTAGCAGC AGCGAGAAAC CGAAGGGGAC AGGGGAAATA 654 0 

AAAAACTGCT CTTTCAATAT CACCACAAGC ATAAGAGATA AGGTGCAGAA ACAATATGCA 6600 

CTTTT TT AT A GCCTTGATGT AGTACCAATG GATGATAATG ATAATAGTAC AAGCTATAGG 666 0 

TTAATAAGTT GTAACACCTC AATCATTACA CAGGCCTGTC CAAAGATATC CTTTGAGCCA 672 0 

ATTCCCATAC ATTATTGTGC CCCGGCTGGT TTTGCGATTC TAAAGTGTAA AGATAAAAGG 6 780 

TTCAATGGAA AAGGACCATG TACAAGTGTC AGCACAGTAC AGTGTACACA TGGAATTAGG 684 0 

CCAGTAGTAT CAACTCAACT GTTGTTAAAT GGCAGTCTAG CAGAAGAAGA GGTAGTAATT 6900 

AGATCTGACA ATTTTACGAA CAATG CTAAA ACCATAATAG TACAGCTGAG CAAATCTGTA 6960 

GAAATTACTT GTGTAAGACC CAACAACAAT ACAAGAAAAA GTATAAGTAT GGGACCAGGG 7020 

AGAGCATTTT ATACAACAGA AATAATAGGA GATATAAGAC AAGCATATTG TAACATTAGT 7080 

AAAGCAAACT GGACTGACAC TTTAGAACAG ATAGCTAGAA AATTAAGAGA ACAATTTGAG 7140 

AATAAAACAA TAGTCTTTAA GCCATCCTCA GGAGGGGACC CAGAAATTGT AACACAGTTT 72 00 

TACAGTTTTA ATTGTGGAGG GGAATTTTTC TACTGTAATT CAACACAACT GTTTAATGGT 72 60 

ACTTGGAATG GTACTTGGGT TAATGGTACT TGGAGTAGTA ATAATACGAC TGATACTGCA 7320 

AATATCACAC TCCCATGCAG AATAAAACAA TTTATAAACA TGTGGCAGGA AGTAGGAAAA 73 80 

GCAATGTATG CCCCTCCCAT CAAAGGACAA ATTAAATGTA CATCAAATAT TACAGGGCTG 744 0 

ATATTAACAA GAGATGGTGG TAACAATAAC ACCACGAACG ACAACGAGAC CGAGACCTTC 750 0 

AGAC CTGG AG GAGGAGATAT GAGGGACAAT TGGAGAAGTG AATTATATAA ATATAAAGTA 756 0 

GTACAAGTTG AACCATTAGG AGTAGCACCC ACCAAGGCAA AGAGAAGAGT GGTG CAAAG A 7620 

GAAAAAAGAG CAGTGGGAAT AGGAG CTATG TTCCTTGGGT TCTTAGGAGC AGCAGGAAGC 7680 

ACTATGGGCG CAGCGTCAGT GACGCTGACG GTACAAGCCA GACAATTATT GTCTGGTATA 7740 



BNSDOCID: <WO 9521912A1 J_> 




WO 95/21912 W W PCT/AU95/00063 

-185- 

GTGCAGCAGC AGAACAATCT GCTGAGGGCT ATTGAGGCGC AACAGCATCT GTTGCAACTC 7 800 

ACAGTCTGGG G CATCAAAC A GCTCCAGGCA AGAGTCCTGG CTGTGGAAAG ATACCTAAGG 7860 

GATCAACAGC TCCTGGGACT TTGGGGTTGC TCTGGAAAAC TCATTTGCAC CACTACTGTG 7 92 0 

CCTTGGAACA ATAG CTGGAG TAATAAATCT CTGGAAACAA TTTGGGATAA CATGACCTGG 7 980 

ATGCAGTGGG AAAGAGAAAT TGACAATTAC ACAAACATAA TATACACCTT AATTGAAGAA 8 04 0 

TCGCAGAACC AACAAGAAAA AAATGAACTA GAATTATTGG AATTGGATAA ATGGGCAAAT 8100 

TTGTGGAATT GGTTTAGTAT ATCAAACTGG CTATGGTATA TAAAATTATT CATAATGGTA 8160 

GTAGGAGGCT TGGTAGGTTT AAGAATAGTT TTTACTGTAC TTTCTATAGT TAATAGAGTT 8220 

AGGCAGGGAT ACTCACCATT ATCGTTTCAG ACCCACCTCC CAACCCCGAA GGGACC CGAC 8280 

AGGCCAGAAG GAATCGAAGA AGAAGGTGGA GAGAGAGACA GAGGCAGCTC CACTCGATTA 8340 

GTGCACGGAT TCTTAGCACT TTTCTGGGAC GACCTGAGGA GTCTGTGCCT CTTCAGCTAC 84 00 

CACCACTTGA GAGACTTACT CTTGATTGTA ACGAGGATTG TGGAACTTCT GGGACGCAGG 8460 

GGATGGGAAG CCCTCAAATA CTGGTGGAAT CTCCTGCAGT ATTGGAGGCA GGAACTACAG 852 0 

AAGAGTGCTG TTAGCTTGTT CAATGGCACG GCCATAGCAG TAG CTGAGGG GACAGATAGA 8S80 

GTTATAGAAG CTTTACGAAG GGCTTATAGA GCTATTCTCC ACATACCTAG AAGAATAAGA 8 64 0 

CAGGGCTTAG AAAGGGCTTT GCTATAAAAT GGGTGGCAAG TGGTCAGAAA GTAGTGTGGT 87 00 

TAGAAGGCAT GTACCTTTAA C-ACAAGGCAG CTATAGATCT TAGCCGCTTT TTAAAAGAAA 8760 

AGGGGGGACT GGAAGGGCTA ATTCACTCAC GGAAAAGACC AGTTGAACCA GAAGAAGATA 8820 

GAAGAGGCCA TGAAGAAGAA AACAACAGAT TGTTCTGCTT GCTCAGCTGG GGACTTTCCA 8880 

GAAGGCGCGG CCTGAGTGAC TAAGCCCCGT TGGGGACTTT CCGAAGAGGC ATGAAGGGAC 8 94 0 

TTTCCAAGGC AGGCGTGGCC TGGGCGGGAC TGGGGAGTGG CGAGCCCTCA GATGCTGCAT 9000 

ATAAGCAGCT GCTTTCTGCC TGTACTGGGT CTCTCTGGTT AGAC CAGATC TGAGCCTGGG 9060 

AGCTCTCTGG CTAGCTAGGG AACCCACTGC TTAAGCCTCA ATAAAGCTTG CCTTGAGTGC 912 0 

TTCAAGTAGT GTGTGCCCGT CTGTTGTGTG ACTC TGGTAT CTAGAGATCC CTCAGACCAT 9180 

TTTAGTCCGT GTGGAAAATC TCTAGCA 92 07 
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CLAIMS: 

1. An isolated HIV-1 strain or a component, part, fragment or derivative thereof 
which is substantially non-pathogenic. 

2. A strain of HIV-1 according to claim 1 wherein said strain is capable of 
stimulating an immune response in a human or primate against at least one glycoprotein 
on HIV-1 whilst not substantially reducing proliferative responses and cytokine 
production to a mitogen in said human or primate compared to a healthy, non-infected 
human or primate subject. 

3. A strain of HIV-1 according to claim 2 which carries one or more mutations in 
its genome resulting in the inability to direct synthesis of at least one polypeptide or 
protein from a pathogenic strain of HIV-1 or results in reduced synthesis of said 
polypeptide or protein. 

4. A strain of HIV-1 according to claim 2 which carries one or more mutations in 
its genome resulting in the said genome directing synthesis of a truncated form of a 
polypeptide or protein from a pathogenic strain of HIV-1. 

5. A strain of HIV-1 according to claim 3 or 4 wherein said strain carries a 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HIV-1 genome. 

6. A strain of HIV-1 according to claim 5 wherein said strain is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nef gene and/or LTR region 
of pathogenic HIV- 1 . 

7. A strain of HIV- 1 according to claim 6 wherein said strain is capable of inducing 
an immune response to at least one of gag, pol and/or env. 
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8. A strain of HIV-1 according to claim 7 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

9. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

10. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (v) 928 1-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

11. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 



12. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 
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nucleotide (xiii) 9105-9224 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

13. A strain of HIV-1 according to claim 8 wherein said deletion encompasses at 
least one of the following decanucleotides from the nef gene of HIV- 1 strain NL4-3: 





(SEQ 


ID 


NO 


:2) ; 


TGGGTGGCAA 


(SEQ 


ID NO 


:3) 


r 




(SEQ 


ID 


NO 


:4) ; 


GGTGGCAAGT 


(SEQ 


ID NO 


:5) , 






(SEQ 


ID 


NO 


: 6) ; 


TGGCAAGTGG 


(SEQ ID 


NO: 7) 


CjGC-AACj 1 bGT 


(SEQ 


ID 


NO 


: 8) ; 


GCAAGTGGTC 


(SEQ 


ID NO 


■ 9) , 




CAAGTGGTCA 


(SEQ 


ID 


NO 


: 10) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO : 


ID 


AGTGGTCAAA 


(SEQ 


ID 


NO 


:12) ; 


GTGGTCAAAA 


(SEQ 


ID 


NO: 


13) 


TGGTGAAAAA 


(SEQ 


ID 


NO 


: 14 ) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO: 


15) 


GTCAAAAAGT 


(SEQ 


ID 


NO 


:16) ; 


TCAAAAAGTA 


(SEQ 


ID 


NO: 


17) 


CAAAAAGTAG 


(SEQ 


ID 


NO 


:18) ; 


AAAAAGTAGT 


(SEQ 


ID NO 


:19) 




AAAAGTAGTG 


(SEQ 


ID 


NO 


.20) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO: 


21) 


AAGTAGTGTG 


(SEQ 


ID 


NO 


22) ; 


AGTAGTGTGA 


/ /inn 

(SEQ 


ID 


NO : 


23) 


GTAGTGTGAT 


(SEQ 


ID 


NO: 


24) ; 


TAGTGTGATT 


(SEQ 


ID NO 


: 25) 




AGTGTGATTG 


(SEQ 


ID 


NO: 


26) ; 


GTGTGATTGG 


(SEQ 


ID 


NO: 


27) 


I vj I bA TTGGA 


(SEQ 


ID 


NO : 


28) ; 


GTGATTGGAT 


(SEQ 


ID 


NO : 


29) 


ICjAI ICjCjATG 


(SEQ 


ID 


NO : 


30) ; 


GATTGGATGG 


/ o'C , p^ 


ID 


IT/TV 

NO : 


31) 




(SEQ 


ID 


NO : 


32) ; 


TTGGATGGCC 


(SEQ 


ID 


NO : 


33) 


TPJIPJ A TP P 1 pr"p 


(SEQ 


ID 


NO : 


34) ; 


GGATGGCCTG 


(SEQ 


ID 


NO: 


35) 


A TCld C PTP P 


i bay 


XJJ 


IMU : 


jdJ ; 


ATGGCCTGCT 


(SEQ 


ID 


NO: 


37) 


TPPPPTinPTip 

1 CjGUCTGCTG 


(SEQ 


ID 


NO : 


38) ; 


GGCCTGCTGT 


(SEQ 


ID 


NO: 


39) 


gcctgctgta 


(SEQ 


ID 


NO: 


40) ; 


CCTGCTGTAA 


(SEQ 


ID 


NO: 


41) 


ctgctgtaag 


(SEQ 


ID 


NO: 


42) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO: 


43) 


gctgtaaggg 


(SEQ 


ID 


NO: 


44) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO: 


45) 


tgtaagggaa 


(SEQ 


ID 


NO: 


46) ; 


GTAAGGGAAA 


(SEQ 


ID 


NO: 


47) 


taagggaaag 


(SEQ 


ID 


NO: 


48) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO: 


49) 


agggaaagaa 


(SEQ 


ID 


NO: 


50) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO: 


51) 


ggaaagaatg 


(SEQ 


ID 


NO: 


52) ; 


GAAAGAATGA 


(SEQ 


ID 


NO: 


53) 


aaagaatgag 


(SEQ 


ID 


NO: 


54) ; 


AAGAATGAGA 


(SEQ 


ID 


NO: 


55) 


agaatgagac 


(SEQ 


ID 


NO: 


56) ; 


GAATGAGACG 


(SEQ 


ID 


NO: 


57) 


aatgagacga 


(SEQ 


ID 


NO: 


58) ; 


ATGAGACGAG 


(SEQ 


ID 


NO: 


59) 


tgagacgagc 


(SEQ 


ID 


NO: 


60) ; 


GAGACGAGCT 


(SEQ 


ID 


NO: 


61) 
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AGACGAGCTG 


(SEQ 


ID 


NO: 


62) ; 


GACGAG CTGA 


(SEQ 


ID 


NO: 63) ; 


ACGAGCTGAG 


(SEQ 


ID 


NO: 


64) ; 


CGAGCTGAGC 


(SEQ 


ID 


NO: 65) ; 


GAGCTGAGCC 


(SEQ 


ID 


NO: 


66) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO: 67) ; 


GCTGAGCCAG 


(SEQ 


ID 


NO: 


68) ; 


CTGAGCCAGC 


(SEQ 


ID 


NO : 6 9 ) ; 


TGAGCCAGCA 


(SEQ 


ID 


NO: 


70) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO: 71) ; 


AGCCAGCAGC 


(SEQ 


ID 


NO: 


72) ; 


GCCAGCAGCA 


(SEQ 


ID 


NO: 73) ; 


CCAGCAGCAG 


(SEQ 


ID 


NO: 


74) ; 


CAGCAGCAGA 


(SEQ 


ID 


NO: 75) ; 


AGCAGCAGAT 


(SEQ 


ID 


NO: 


76) ; 


GCAGCAGATG 


(SEQ 


ID 


NO:77) ; 


CAGCAGATGG 


(SEQ 


ID 


NO: 


78) ; 


AGCAGATGGG 


(SEQ 


ID 


NO: 79) ; 


GCAGATGGGG 


(SEQ 


ID 


NO: 


80) ; 


CAGATGGGGT 


(SEQ 


ID 


NO: 81) , 


AGATGGGGTG 


(SEQ 


ID 


NO: 


82) ; 


GATGGGGTGG 


(SEQ 


ID 


NO: 83) ; 


ATGGGGTGGG 


(SEQ 


ID 


NO: 


84) ; 


TGGGGTGGGA 


(SEQ 


ID 


NO: 85) , 


GGGGTGGGAG 


(SEQ 


ID 


NO: 


86) ; 


GGGTGGGAGC 


(SEQ 


ID 


NO: 87) , 


GGTGGGAGCA 


(SEQ 


ID 


NO: 


88) ; 


GTGGGAGCAG 


(SEQ 


ID 


NO: 89) , 


TGGGAGCAGT 


(SEQ 


ID 


NO: 


90) ; 


GGGAGCAGTA 


(SEQ 


ID 


NO: 91) , 


.GGAGCAGTAT 


(SEQ 


ID 


NO: 


92) ; 


GAGCAGTATC 


(SEQ 


ID 


NO: 93) , 


AGCAGTATCT 


(SEQ 


ID 


NO: 


94) ; 


GCAGTATCTC 


(SEQ 


ID 


NO: 95) , 


CAGTATCTCG 


(SEQ 


ID 


NO: 


96) ; 


AGTATCTCGA 


(SEQ 


ID 


NO: 97) 


GTATCTCGAG 


(SEQ 


ID 


NO: 


98) ; 


TATCTCGAGA 


(SEQ 


ID 


NO: 99) 


ATCTCGAGAC 


(SEQ 


ID 


NO: 


100) ; 


TCTCGAGACC 


(SEQ 


ID NO:101) 


CTCGAGACCT 


(SEQ 


ID 


NO: 


102) ; 


TCGAGACCTA 


(SEQ 


ID NO 


:103) ; 


CGAGACCTAG 


(SEQ 


ID 


NO 


104) ; 


GAGACCTAGA 


(SEQ 


ID NO 


:105) ; 


AGACCTAGAA 


(SEQ 


ID 


NO 


.106) ; 


GACCTAGAAA 


(SEQ 


ID NO 


:107) ; 


AC CTAGAAAA 


(SEQ 


ID 


NO 


: 108) ; 


CCTAGAAAAA 


(SEQ 


ID NO 


:109) ; 


CTAGAAAAAC 


(SEQ 


ID 


NO 


:110) ; 


TAGAAAAACA 


(SEQ 


ID NO 


:111) ; 


AGAAAAACAT 


(SEQ 


ID 


NO 


:112) ; 


GAAAAACATG 


(SEQ 


ID NO 


:113) ; 


AAAAACATGG 


(SEQ 


ID 


NO 


:114) ; 


AAAACATGGA 


(SEQ 


ID NO 


:115) ; 


AAACATGGAG 


(SEQ 


ID 


NO 


:116) ; 


AACATGGAGC 


(SEQ 


ID NO 


:117) ; 


ACATGGAGCA 


(SEQ 


ID 


NO 


:118) ; 


CATGGAGCAA 


(SEQ 


ID NO 


:119) ; 


ATGGAG CAAT 


(SEQ 


ID 


NO 


:120) ; 


TGGAGCAATC 


(SEQ 


ID NO 


:121) ; 


GG AGCAATCA 


(SEQ 


ID 


NO 


:122) ; 


GAGCAATCAC 


(SEQ 


ID NO 


-.123) ; 


AG CAAT C AC A 


• (SEQ 


ID 


NO 


:124) ; 


GCAATCACAA 


(SEQ 


ID NO 


:125) ; 


PA ATP ATA AG 


(SEO 


ID 


NO 


:126) ; 


AATCACAAGT 


(SEQ 


ID NO 


:127) ; 


ATCACAAGTA 


(SEQ 


ID 


NO 


:128) ; 


TCACAAGTAG 


(SEQ 


ID NO 


:12 9) ; 


CACAAGTAGC 


(SEQ 


ID 


NO 


:13 0) ; 


ACAAGTAGCA 


(SEQ 


ID NO 


:131) ; 


CAAGTAGCAA 


(SEQ 


ID 


NO 


:132) ; 


AAGTAGCAAT 


(SEQ 


ID NO 


:133) ; 


AGTAGCAATA 


(SEQ 


ID 


NO 


:134) ; 


GTAGCAATAC 


(SEQ 


ID NO 


:135) ; 


TAGCAATACA 


(SEQ 


ID 


NO 


:136) ; 


AGCAATACAG 


(SEQ 


ID NC 


»:137) ; 
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GCAATACAGC (SEQ ID NO : 13 8 ) ; 

AATACAGCAG (SEQ ID NO : 14 0 ) ; 

T AC AG C AG CT (SEQ ID NO: 142); 

CAGCAGCTAA (SEQ ID NO: 144); 

GCAGCTAACA (SEQ ID NO: 146) ; 

AGCTAACAAT ( SEQ ID NO : 14 8 ) ; 

CTAACAATGC (SEQ ID NO:150); 

AACAATGCTG (SEQ ID NO: 152) ; 

CAATGCTGCT (SEQ ID NO:154); 

ATGCTGCTTG (SEQ ID NO: 156); 

GCTGCTTGTG (SEQ ID NO:158); 

TGCTTGTGCC (SEQ ID NO: 160); 

CTTGTGCCTG (SEQ ID NO:162); 

TGTGCCTGGC (SEQ ID NO: 164); 

TGCCTGGCTA (SEQ ID NO: 166); 

CCTGGCTAGA (SEQ ID NO: 168); 

TGGCTAGAAG (SEQ ID NO: 170); 

GCTAGAAGCA (SEQ ID NO:172); 

TAGAAGCACA (SEQ ID NO:174); 

GAAGCACAAG (SEQ ID NO:176); 
AGCACAAGAG (SEQ ID NO: 178); 

CACAAGAGGA (SEQ ID NO: 18 0) ; 

CAAGAGGAGG ( SEQ ID NO : 1 8 2 ) ; 
AGAGGAGGAA (SEQ ID NO: 184); 
AGGAGGAAGA (SEQ ID NO: 186); 
GAGGAAGAGG (SEQ ID NO: 188) ; 
GGAAGAGGTG ( SEQ ID NO : 1 9 0 ) ; 
AAGAGGTGGG (SEQ ID NO: 192); 
GAGGTGGGTT (SEQ ID NO: 194); 
GGTGGGTTTT (SEQ ID NO: 196); 
TGGGTTTTCC (SEQ ID NO: 198) ; 
GGTTTTCCAG (SEQ ID NO: 2 00); 
TTTTCCAGTC (SEQ ID NO:202); 
TTCCAGTCAC (SEQ ID NO: 2 04); 
CCAGTCACAC (SEQ ID NO:206); 
AGTCACACCT (SEQ ID NO:208); 
TCACACCTCA (SEQ ID NO: 2 10); 
ACACCTCAGG (SEQ ID NO: 212) ; 
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CAATACAGCA (SEQ ID NO: 139); 

ATACAGCAGC (SEQ ID NO:141); 

ACAGCAGCTA (SEQ ID NO: 143); 

AGCAGCTAAC (SEQ ID NO:145); 

CAGCTAACAA (SEQ ID NO:147); 

GCTAACAATG (SEQ ID NO:149); 

TAACAATGCT (SEQ ID NO:151); 

ACAATGCTGC (SEQ ID NO:153); 

AATGCTGCTT (SEQ ID NO:155); 

TGCTGCTTGT (SEQ ID NO:157); 

CTGCTTGTGC (SEQ ID NO:159); 

GCTTGTGCCT (SEQ ID NO:161); 

TTGTGCCTGG (SEQ ID NO: 163) ; 

GTGCCTGGCT (SEQ ID NO:165); 

GCCTGGCTAG (SEQ ID NO:167); 

CTGGCTAGAA (SEQ ID NO:169); 

GGCTAGAAGC (SEQ ID N0:171); 

CTAGAAGCAC (SEQ ID NO:173); 
AGAAGCACAA (SEQ ID NO:175); 
AAGCACAAGA (SEQ ID NO: 177); 
GCACAAGAGG (SEQ ID NO:179); 
ACAAGAGGAG (SEQ ID NO:181); 
AAGAGGAGGA (SEQ ID NO:183); 
GAGGAGGAAG (SEQ ID NO : 1 8 5 ) ; 
GGAGGAAGAG (SEQ ID NO:187); 
AGGAAGAGGT (SEQ ID NO:189); 
GAAGAGGTGG (SEQ ID NO: 191); 
AGAGGTGGGT (SEQ ID NO:193); 
AGGTGGGTTT (SEQ ID NO: 195); 
GTGGGTTTTC (SEQ ID NO: 197); 
GGGTTTTCCA (SEQ ID NO:199); 
GTTTTCCAGT (SEQ ID NO:201); 
TTTCCAGTCA (SEQ ID NO:203); 
TCCAGTCACA (SEQ ID NO:205); 
CAGTCACACC (SEQ ID NO:207); 
GTCACACCTC (SEQ ID NO:209); 
CACACCTCAG (SEQ ID NO:211); 
CACCTCAGGT (SEQ ID NO:213); 
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ACCTCAGGTA 


(SEQ 


ID 


NO: 


214) ; 


CCTCAGGTAC 


(SEQ 


ID 


NO: 


215) 


CTCAGGTACC 


(SEQ 


ID 


NO: 


216) ; 


TCAGGTACCT 


(SEQ 


ID 


NO: 


217) 


CAGGTACCTT 


(SEQ 


ID 


NO: 


218) ; 


AGGTACCTTT 


(SEQ 


ID 


NO: 


219) 


GGTACCTTTA 


(SEQ 


ID 


NO: 


220) ; 


GTACCTTTAA 


(SEQ 


ID 


NO: 


221) 


TACCTTTAAG 


(SEQ 


ID 


NO: 


222) ; 


ACCTTTAAGA 


(SEQ 


ID 


NO: 


223) 


CCTTTAAGAC 


(SEQ 


ID 


NO: 


224) ; 


CTTTAAGACC 


(SEQ 


ID 


NO: 


225) 


TTTAAGACCA 


(SEQ 


ID 


NO: 


226) ; 


TTAAGACCAA 


(SEQ 


ID 


NO: 


227) 


TAAGACCAAT 


(SEQ 


ID 


NO: 


228) ; 


AAGAC CAATG 


(SEQ 


ID 


NO: 


229) 


AGACCAATGA 


(SEQ 


ID 


NO: 


230) ; 


GAC CAATGAC 


(SEQ 


ID 


NO: 


231) 


ACCAATGACT 


(SEQ 


ID 


NO: 


232) ; 


CCAATGACTT 


(SEQ 


ID 


NO: 


23 3) 


CAATGACTTA 


(SEQ 


ID 


NO: 


234) ; 


AATGACTTAC 


(SEQ 


ID 


NO: 


235) 


ATGACTTACA 


(SEQ 


ID 


NO: 


236) ; 


TGACTTACAA 


(SEQ 


ID. 


NO: 


237) 


GACTTACAAG 


(SEQ 


ID 


NO: 


238) ; 


ACTTACAAGG 


(SEQ 


ID 


NO: 


239) 


CTTACAAGGC 


(SEQ 


ID 


NO: 


24 0) ; 


TTACAAGGCA 


(SEQ 


ID 


NO: 


241) 


TACAAGGCAG 


(SEQ 


ID 


NO: 


242) ; 


ACAAGGCAGC 


(SEQ 


ID 


NO: 


243) 


CAAGGCAGCT 


(SEQ 


ID 


NO: 


244) ; 


AAGGCAGCTG 


(SEQ 


ID 


NO: 


245) 


AGGCAGCTGT 


(SEQ 


ID 


NO: 


246) ; 


GGCAGCTGTA 


(SEQ 


ID 


NO: 


247) 


GCAGCTGTAG 


(SEQ 


ID 


NO: 


248) ; 


CAGCTGTAGA 


(SEQ 


ID 


NO: 


249) 


AGCTGTAGAT 


(SEQ 


ID 


NO: 


250) ; 


GCTGTAGATC 


(SEQ 


ID 


NO: 


251) 


CTGTAGATCT 


(SEQ 


ID 


NO: 


252) ; 


TGTAGATCTT 


(SEQ 


ID 


NO: 


253) 


GTAGATCTTA 


(SEQ 


ID 


NO: 


254) ; 


TAGATCTTAG 


(SEQ 


ID 


NO: 


255) 


AGATCTTAGC 


(SEQ 


ID 


NO: 


256) ; 


GATCTTAGCC 


(SEQ 


ID 


NO: 


257) 


ATCTTAGCCA 


(SEQ 


ID 


NO: 


258) ; 


TCTTAGCCAC 


(SEQ 


ID 


NO: 


259) 


CTTAGCCACT 


(SEQ 


ID 


NO: 


260) ; 


TTAGCCACTT 


(SEQ 


ID 


NO: 


261) 


TAGCCACTTT 


(SEQ 


ID 


NO 


262) ; 


AGCCACTTTT 


(SEQ 


ID 


NO- 


263) 


GCGACTTTTT 


(SEQ 


ID 


NO 


264) ; 


CCACTTTTTA 


(SEQ 


ID 


NO 


265) 


CACTTTTTAA 


(SEQ 


ID 


NO 


:266) ; 


ACTTTTTAAA 


(SEQ 


ID 


NO 


:267) 


CTTTTTAAAA 


(SEQ 


ID 


NO 


:268) ; 


TTTTTAAAAG 


(SEQ 


ID 


NO 


:269) 


TTTTAAAAGA 


(SEQ 


ID 


NO 


:270) ; 


TTTAAAAGAA 


(SEQ 


ID 


NO 


:271) 


TTAAAAGAAA 


(SEQ 


ID 


NO 


:272) ; 


TAAAAGAAAA 


(SEQ 


ID 


NO 


:273) 


AAAAGAAAAG 


(SEQ 


ID 


NO 


:274) ; 


AAAGAAAAGG 


(SEQ 


ID 


NO 


:275) 


AAGAAAAGGG 


(SEQ 


ID 


NO 


:276) ; 


AGAAAAGGGG 


(SEQ 


ID 


NO 


:277) 


GAAAAGGGGG 


(SEQ 


ID 


NO 


:278) ; 


AAAAGGGGGG 


(SEQ 


ID 


NO 


:279) 


AAAGGGGGGA 


(SEQ 


ID 


NO 


:280) ; 


AAGGGGGGAC 


(SEQ 


ID 


NO 


:281) 


AGGGGGGACT 


(SEQ 


ID 


NO 


:282) ; 


GGGGGGACTG 


(SEQ 


ID 


NO 


:283) 


GGGGGACTGG 


(SEQ 


ID 


NO 


:284) ; 


GGGGACTGGA 


(SEQ 


ID 


NO 


:285) 


GGGACTGGAA 


(SEQ 


ID 


NO 


:286) ; 


GGACTGGAAG 


(SEQ 


ID 


NO 


:287) 


GACTGGAAGG 


(SEQ 


ID 


NO 


:288) ; 


ACTGGAAGGG 


(SEQ 


ID 


NO 


:289-) 
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CTGGAAGGGC (SEQ ID NO: 2 90) 
GGAAGGGCTA (SEQ ID NO: 292) 
AAGGGCTAAT (SEQ ID NO: 2 94) 
GGGCTAATTC (SEQ ID NO: 2 96) 
GCTAATTCAC (SEQ ID NO: 2 98) 
TAATTCACTC (SEQ ID NO: 3 00) 
ATTCACTCCC (SEQ ID NO: 3 02) 
TCACTCCCAA (SEQ ID NO: 3 04) 
ACTCCCAAAG (SEQ ID NO: 3 06) 
TCCCAAAGAA (SEQ ID NO: 3 08) 
CCAAAGAAGA (SEQ ID NO: 310) 
AAAGAAGACA (SEQ ID NO: 312) 
AGAAGACAAG (SEQ ID NO: 3 14) 
AAGACAAGAT (SEQ ID NO: 316) 
GACAAGATAT (SEQ ID NO: 318) 
CAAGATATCC (SEQ ID NO: 3 20) 
AGATATCCTT (SEQ ID NO: 322) 
ATATCCTTGA (SEQ ID NO: 3 24) 
ATCCTTGATC (SEQ ID NO: 3 26) 
CCTTGATCTG (SEQ ID NO: 328) 
TTGATCTGTG (SEQ ID NO: 33 0) 
GATCTGTGGA (SEQ ID NO: 332) 
TCTGTGGATC (SEQ ID NO: 334) 
TGTGGATCTA (SEQ ID NO: 336), 
TGGATCTACC (SEQ ID NO: 338), 

GATCTACCAC (SEQ ID NO: 340), 

TCTACCACAC (SEQ ID NO: 342), 

TACCACACAC (SEQ ID NO: 344) ; 

CCACACACAA (SEQ ID NO:346); 

ACACACAAGG (SEQ ID NO: 348); 

ACACAAGGCT (SEQ ID NO: 3 50); 

ACAAGGCTAC (SEQ ID NO: 3 52) ; 

AAGGCTACTT (SEQ ID NO: 3 54); 

GGCTACTTCC (SEQ ID NO: 356) ; 

CTACTTCCCT (SEQ ID NO: 3 58) ; 

ACTTCCCTGA (SEQ ID NO: 3 60) ; 

TTCCCTGATT (SEQ ID NO: 3 62); 

CCCTGATTGG (SEQ ID NO: 3 64); 
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TGGAAGGGCT 




ID 


MO 


• X / 


GAAGGGCTAA 




X XV 


NO 


• Z J J / 






XJJ 


Mn 


. OQt;\ 


x t\£\ 1 X LA 




TT\ 
1U 


Mn 
XMvJ 


: z y / ; 


L 1 x 1 Lr\L x 




Tn 
XJJ 


JNU 


» T Q Q \ 

: z y y } 


A ATT PA PITT* 
X X ^i~lL ILL 




X JJ 


JNU 


: JU1J 


X X V_riV_ X LLLn 




XJJ 


wn 


• "3 Pk *5 \ 




( QPn 


Tn 


Mn 


• ^ n k \ 


PTPPPA&APA 
X LLLrvirtvjri 


V oJiLj 


X JJ 


Mn 


. JU / ) 


PPPA A AHA AP 


/ opn 


Tn 

X JJ 


Mn 


i j uy / 


PA A APA APAP 


( Qpn 


Tn 

XU 


Mn 


loll] 




/ QPn 


Tn 

X JJ 


JNU 


: J13J 




/ QUA 


Tn 

XJJ 


MO 
JNU 


J Xd ; 


APAPA APATA 


( QPn 


Tn 

X u 


Mn 


Jl/j 


APAAGATATP 


/ qpn 


Tn 

XJJ 


Mn - 


O X j ) 


AAPATATPPT 


( Qpn 

^ OE*Lj 


Tn 

X u 


Mn - 


-3 Z X / 


PATATPPTTP 




Tn 

XJJ 


Mn . 
JNU : 


*3 O *3 \ 


TATPPTTCAT 
1/iJ.LLl 1 Uri 1 




Tn 
x jj 


Mn . 
iNU : 


~i O C \ 


1 LL X x LjAI L 1 




XJJ 


JNU : 


-) <-A 1-7 \ 

327 ) 


PTTPiTPTPT 
LI 1 \jJ\ ILlul 






INU : 


32 9; 


TPATPTPTPP 
x LjH. ILILI LjLj 




Tn 

X JJ 


JNU : 




A T PTPTPP a T 
nILIuI X 




Tn 

XJJ 


wn . 
JNU : 


"3 *3 *5 \ 
J J J j 


PTPTPPATPT 

n— X Vj7 X \J\jr\ X L X 


/ Qpn 


Tn 


Mn . 
JNU : 


O O C \ 


PTPPATPTAr* 
vj X LrLf/-i 1 L 1 HL 


/ opn 


Tn 


Mn . 
JNU : 


337) 


nnziTPTappa 

LrLr/\X L 1 HLLR 


/ opn 


XJJ 


NO : 


33 9; 


ATPTappapa 

H1L liiLLiiLA 


/ ccn 


ID 


NU : 


341 ; 


L 1 HLLiiLnLA 


/ cpn 


XJJ 


JNO : 


3 43) 


APPz\r , ivr i Ar , A 

rlL LrlLhLHLH 


( Qpn 


Tn 

XJJ 


Kin . 
INU : 


34 5) 


PAPAPAPA AH 


( Qpn 


Tn 

XJJ 


Mn . 
JNU : 


"3 A ""7 \ 


PAPAPAAPPP 


f cpn 


Tn 

XJJ 


Mn * 
jnu : 


j4" j 


CACAAGGPTA 


f Qpn 


X LJ 


Mn « 


OCT 1 


CAAGGCTAPT 




Tn 

XXJ 


Mn • 




AGGCTAPTTP 

*w X X X Via* 




Tn 

X xJ 


Mn - 


J JO) 


GCTACTTCCC 




TD 

X LJ 


l^\J . 


JJ / / 


xi-iL 1 X LLL X L 


/ CPA 


XU 


JNU : 


^ r rt \ 

359) 


CTTCCCTGAT 


(SEQ 


ID 


NO: 


361) 


TCCCTGATTG 


(SEQ 


ID 


NO: 


363) , 


CCTGATTGGC 


(SEQ 


ID 


NO: 


365) , 
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CTGATTGGCA 


(SEQ 


ID 


NO: 


366) 


GATTGGCAGA 


(SEQ 


ID 


NO: 


368) 


TTGGCAGAAC 


(SEQ 


ID 


NO: 


370) 


GGCAGAACTA 


(SEQ 


ID 


NO: 


372) 


CAGAACTACA 


(SEQ 


ID 


NO: 


374) 


GAACTACACA 


(SEQ 


ID 


NO: 


376) 


ACTACACACC 


(SEQ 


ID 


NO: 


378) 


T AC AC AC C AG 


(SEQ 


ID 


NO: 


380) 


CACACCAGGG 


(SEQ 


ID 


NO: 


382) 


CACCAGGGCC 


(SEQ 


ID 


NO: 


384) 


CCAGGGCCAG 


(SEQ 


ID 


NO: 


386) 


AGGGCCAGGG 


(SEQ 


ID 


NO: 


388) 


GGCCAGGGGT 


(SEQ 


ID 


NO : 


390 ) 


CCAGGGGTCA 


(SEQ 


ID 


NO : 


392 ) 


AGGGGTCAGA 


(SEO 


ID 


NO : 


394 ) 


GGGTCAGATA 


(SEQ 


ID 


NO : 


396) 


G T C AG AT AT C 


(SEQ 


ID 


NO: 


398) 


CAGATATCCA 


(SEQ 


ID 


NO: 


400) 


GATATCCACT 


(SEQ 


ID 


NO: 


402) 


TATCCACTGA 


(SEQ 


ID 


NO: 


404) 


TCCACTGACC 


(SEQ 


ID 


NO: 


4 06) 


CACTGACCTT 


(SEQ 


ID 


NO : 


408 ) 


CTGACCTTTG 


f SEO 


TD 


NO * 


*± JL \J t 


GAPOTTTOGA 


/ qpo 


IT) 


NO • 


*± X ) 


*w L- X X X wlxrl X W 


qpo 


TV) 
xxj 




A 1 A \ 
*x X *± / 


X X X VJTVJ^l 1 VJTVJ X 




x u 


MO * 




TGGATGGTGC 


(SEQ 


ID 


NO : 


418 ) 


GATGGTGCTA 


(SEQ 


ID 


NO : 


420) 


TGGTGCTACA 


(SEQ 


ID 


NO: 


422) 


GTGCTACAAG 


(SEQ 


ID 


NO: 


424) 


GCTACAAGCT 


(SEQ 


ID 


NO: 


426) 


TACAAGCTAG 


(SEQ 


ID 


NO: 


428) 


OAAGPTAOTA 




x u 


NO • 

IN W . 


a n ) 

" j u y 


AGCTAGTACC 


(SEQ 


ID 


NO: 


432) 


C TAG T AC C AG 


(SEQ 


ID 


NO: 


434) 


AGTACCAGTT 


(SEQ 


ID 


NO: 


436) 


TACCAGTTGA 


(SEQ 


ID 


NO: 


438) 


CCAGTTGAGC 


(SEQ 


ID 


NO: 


440) 
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TGATTGGCAG 


(SEQ 


ID 


NO: 


3 67) ; 


ATTGGCAGAA 


(SEQ 


ID 


NO: 


369) ; 


TGGCAGAACT 


(SEQ 


ID 


NO: 


371) ; 


GCAGAACTAC 


(SEQ 


ID 


NO: 


3 73) ; 


AGAACTACAC 


(SEQ 


ID 


NO: 


375) ; 


AACTACACAC 


(SEQ 


ID 


NO: 


377) ; 


CTACACACCA 


(SEQ 


ID 


NO: 


379) ; 


ACACACCAGG 


(SEQ 


ID 


NO 


381) ; 


ACACCAGGGC 


(SEQ 


ID 


NO 


3 83) ; 


ACCAGGGCCA 


(SEQ 


ID 


NO: 


385) ; 


CAGGGCCAGG 


(SEQ 


ID 


NO: 


3 87) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO: 


389) ; 


GCCAGGGGTC 


(SEQ 


ID 


NO: 


3 91) ; 


CAGGGGTCAG 


(SEQ 


ID 


NO: 


3 93) ; 


GGGGTCAGAT 


(SEQ 


ID 


NO: 


3 95) ; 


GGT CAGAT AT 


(SEQ 


ID 


NO: 


3 97) ; 


TCAGATATCC 


(SEQ 


ID 


NO: 


3 99) ; 


AGATATCCAC 


(SEQ 


ID 


NO: 


4 01) ; 


ATATCCACTG 


(SEQ 


ID 


NO: 


403) ; 


ATCCACTGAC 


(SEQ 


ID 


NO: 


405) ; 


CCACTGACCT 


(SEQ 


ID 


NO: 


4 07) ; 


ACTGACCTTT 


(SEQ 


ID 


NO: 


409) ; 


TGACCTTTGG 


(SEQ 


ID 


NO: 


411) ; 


AC CTTTGGAT 


(SEQ 


ID 


NO- 


413 ) ; 


PTTTOGATGG 
X X X VJ^JiS. X \J\J 


(SEQ 


ID 


NO 


415) : 


TTGGATGGTG 

X X VJVJXTt X VJ X W 


(SEQ 


ID 


NO 


417) ; 


GGATGGTGCT 


(SEO 


ID 


NO 


419) ; 


ATGGTGCTAC 


(SEQ 


ID 


NO 


421) ; 


GGTGCTACAA 


(SEQ 


ID 


NO 


423) ; 


TGCTACAAGC 


(SEQ 


ID 


NO 


:425) ; 


CTACAAGCTA 


(SEQ 


ID 


NO 


:427) ; 


ACAAGCTAGT 


(SEQ 


ID 


NO 


:429) ; 


AAGCTAGTAC 


(SEQ 


ID 


NO 


:431) ; 


GCTAGTACCA 


(SEQ 


ID 


NO 


:433) ; 


TAGTACCAGT 


(SEQ 


ID 


NO 


:435) ; 


GTACCAGTTG 


(SEQ 


ID 


NO 


:437) ; 


ACCAGTTGAG 


(SEQ 


ID 


NO 


:439) ; 


CAGTTGAGCC 


(SEQ 


ID 


NO 


:441) ; 
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AfTTP AOOOA 

1 X Lj/\b7b-U/\ 


f CPP) 
V oXjU. 


X. u 


IN (J 


• A A 1 \ * 

: ft ft z } f 


1 1 Lxtt.b7U U-H.b7.rt. 


/ ot?n 

\ OCjU. 


± u 




A A A } * 
> ft ft ft } , 


o a o o o a o a t a 

bri-ib? L- U/\Lxrt I rt. 




XL) 




A A C \ . 

ft ft o ) ; 


oooapataao 


z' ct?n 


T T\ 
± U 


MP « 


A A Q \ ■ 

ft ft o ; ; 


U-rtbxrt. 1 rlrtUU 1 


/ OT70 
[ £>fcjU. 


rn 
1JJ 


M(^ . 

jnu : 


ft 3 U / / 


oat a a opt a o 

vj/"i. 1 .rtrt.VjVj 1 <rt.U 


/ QT7P 


J.1J 


MP « 


A c: o \ , 
ft o z / ; 


T A A POT A O A A 
1 rinuU 1 rtbirt-rt. 


( QtPPl 
^ oHjU, 


TFl 

±xJ 


MP - 


A c: A \ • 
ft _> ft ) , 




V OH.U. 


TTi 


MP • 


ft D o J f 




f QPO 


TPi 


MP • 


a c: p \ . 

ft D C5 ) / 


zv p a a o a ooo o 


(£>H.U. 


TPi 


MP • 


ft o u } ; 


a apappppa a 

■rt-rt.vjrt.V7 U U rtrt. 


( bliU. 


TFi 


ATP . 


ft b Z ; ; 


nivnnrr'a at* a 

birt-bibjU b.rt_H. 1 rt. 


K oejO 


T Tl 


JNU : 


4 b ft ; ; 


nrnra at a a a 

UUb-v — rtrt. x AAA 


/ CXPO 
V Oll,Q 




Mrs , 
INU : 


ft b b ) ; 


pp a at a a apo 

U Urtrt. X i\jt\jt\\JJ\J3 


/ crn 

\ 


TTt 




A C Q \ * 

ft b o ; ; 


A AT A A APPAO 
I AAAbjb* Abj 


/ OTTO 


XL) 


isJU : 


4 7 U / ; 


1 AAAULjAUAb? 


(SEQ 


T "Pi 

ID 


iMO : 


4 72) ; 


AAbi U AO AO AA 


(SEQ 


ID 


NO : 


4 74 j ; 


OO A O A O A a o a 

OOAOAGAACA 


( SEQ 


ID 


NO : 


4 76) ; 


a n a n a a Pa oo 
AO AO AA LALL 


/ cn»n 
( SEQ 


ID 


NO : 


4 78) ; 




/ OTTO 


T T\ 
±U 


NO : 


4 o U ) ; 


A A PA PP A PPT 
rtJA UrtUb-rtUU 1 


( oEQ 


TPl 


M/^ * 


A Q O \ . 

4 cs z ) ; 


P APPAOPTTP 




T n 
X U 


MO * 
iNVj . 


ft o ft ) ; 


PPAPPTTPTT 
L_ LAb L, X i bx 1 1 


/ CT7P 

\ OE*Q 


Tn 

X JJ 


MP - 

NU : 


A Q C \ . 

4 o b ) ; 


APPTTPTT AP 
.H.b7U I x bx I iAL 


/ CPO 


XJJ 


NU : 


4 o o ) ; 


PTTOTT A P A P 
U X lul 1 .rt.b~rt.b- 




Tn 
X u 


MP - 


ft y u ^ ; 


TPTT AP A PPP 


/ CTTP 

VoiLU^ 


Tn 
x u 


MP * 
INU I 


ft i7 z j ; 


TTAPAPPPTO 


( o£iQ 


Tn 


MP • 
XNVJ . 


ft jft y , 


APAPPPTOTO 


/ CT?P 


Tn 

X D 


MP • 

jnu : 


ft y b ) ; 


APPPTPTPAO 
-rtLUU ibl Lxrtbr 




Tn 

XJJ 


MO . 

inu : 


/I QQ \ , 

4 y o ^ ; 


PPTOTP APPP 




Tn 

XJJ 


MP . 


o u u ) ; 


totoappptp 

1 bj J. bxAUUU 1 bj 


^ QT7P\ 


Tn 


MO . 
JNU I 


cno\ . 
jUZ j ; 


TO APPPTOP A 


V OX^iV^ 


Tn 

XJJ 


MO . 
iN\J . 


o u 4 ; ; 


A OPPTP P A TP 
rtb^b-b- x bjUA X o 


iisJiiQ 


Tn 

XJJ 


MO . 
INU t 


c n cz \ . 
DUoj ; 


PPTOP A TPP A 
Lb. 1 bJbAx bun 


Vi3£iQ 


Tn 

XJJ 


MO . 

inu : 


DUoj ; 


TGCATGGAAT 


(SEQ 


ID 


NO : 


510) ; 


CATGGAATGG 


(SEQ 


ID 


NO: 


512) ; 


TGGAATGGAT 


(SEQ 


ID 


NO: 


514) ; 


GAATGGATGA 


(SEQ 


ID 


NO: 


516) ; 



194 - 



OTTO A O r*0 A C 


/ CT70 


Tn 

XJJ 


MO 
INU 


- A A "3 \ * 

: ft ft _3 ) f 


TO AO OO AO AT 
I bAbLLAbA i 


( cun 
\ oi^Ui 


Tn 

XJJ 


MO 
JNU 


: 4 4 d ) ; 


APPP AP AT A A 




Tn 
XJJ 


MO 
JNU 


• A A H \ . 

: 4 4 / ) ; 


OO AO ATA AOO 
U Abi A 1 AAbrbi 


/ CT^O 
^ OCiLJ 


Tn 
XL) 


MO 
NU 


. A A Q \ . 

: 4 4 y ) ; 


AO ATA AOOT A 
AbA 1 AAbb 1 A 


( crn 


Tn 
XJJ 


MO 

INU 


: 4 o I ) ; 


AT A AOOT AO A 


/ ct?o 


Tn 

XJJ 


MO 
JNU 


. a c: *a \ . 
1 ft j J / ; 


A APOTAPA AP 


V oHjU. 


Tn 

XJJ 


MO 
JNU 


„ A C C 1 . 

: 4 d d J ; 


PPT APA AO AO 




Tn 
xu 


MP 
INU 


. ft D / / , 


TAP A APAPPO 




Tn 

XJJ 


MP 
JNU 


. A C Q \ . 


PA APAPOOOA 
\j AAb AVj\j \— LA 


/ OTTO 


Tn 

XJJ 


MP 
INU 


i 4 o x / ; 


AO A OOOO A AT 


/ CTTP 


Tn 

XJJ 


MP 
JNU 


; td j j ; 


AOOOOAAHHA A 
AbibU LAA I AA 


/ nT7>n 

lohiQ 


T "Hi 
XJJ 


MO 

NU 


! 4b j ) ; 


OOO A A TA A AO 
bLLAAlAAAb 




XU 


MO 

NO 


a an \ . 
4b/) ; 


OA ATA A AOO A 
LAA I AAAvjVjA 




t n 
XJJ 


MO 

NU 


4 b y ) ; 


ATA AAOOAOA 

A I AAAvjoAL? A 


(SEQ 


XX) 


Mn . 
NO 


4 / 1 ) ; 


A A AOOAOAOA 
AAALjCjALjALjA 


V oh,Q 


ID 


MO 

NO ; 


4 73) ; 


AGGAGAGAAC 


( SEQ 


ID 


NO : 


475) / 


OAOAOA A/"»AO 

(jAOACjAAOAO 


(SEQ 


ID 


NO : 


4 77) ; 


OAOA A OA OOA 

bAbAALAL OA 


( SEQ 


ID 


NO : 


/I n ft \ 

4 7 9) ; 


o a a o a r*r*7i r*r* 
bAALAL LAb O 


( bby 


ID 


MO . 

NO : 


4 81) ; 


A OA OOA OOTT 
ALALLAbL 1 1 


vSh,Q 


Tn 
ID 


MO * 

INU : 


yl Q 1 \ . 

4 0 0 ) ; 


aooapotTpt 

ALLAbLi 1 Lj 1 


/ OTTO 
VO£*Q 


Tn 
XJJ 


MO - 

inu : 


A Q c ^ . 
4 0 d ) ; 


P APPTTPTT A 
LALr L. X I Lj 1 1 A 


/ OTTO 


Tn 
ID 


MO . 

NO : 


4 0 / ) ; 


PPTTPTTAPA 
bLl ILjI iALA 




Tn 
XL) 


MO . 

NU : 


4 0 y ) ; 


TTPTTAO APP 
i i b 1 1ALALL 




Tn 
ID 


MO . 

NU : 


4 y I ) ; 


PTT AP APPPT 
bj X 1ALALLL1 




Tn 

ID 


MO . 

inu : 


A Q *3 \ . 

4 y j ; ; 


TAPAPPPTPT 
1ALALLL lbi 


(i>hiQ 


Tn 
x u 


MO . 

NU : 


4 y b ) ; 


pa r i r % r ir rr' r rr* a 

LALLL lbJ. 


/ QT?0 


Tn 

XL) 


MO . 

jnu : 


4 y / ) ; 


LLLlbl LxH\jL 


(ijJiiQ 


Tn 
XL) 


MO • 

NO : 


4 y y ) ; 


OTPTPAOOOT 
U 1 bj X LaAbrLL I 


/ O T?0 


ID 


MO . 

NO : 


bOl) ; 


PTPAPPPTPP 
bj X bj/\VjUL 1 brL 




Tn 

XX) 


MO * 

nu : 


0 u J ) ; 


PAPPPTPP AT 




Tn 
XJJ 


MO . 

NU : 


enc \ . 

jUb) ; 


PPPTPPATPP 
brLL X bLAl L7L7 




Tn 
ID 


MO . 

NO : 


b U / ) ; 


PTPPATPPA A 
b- X VjUA X bbAA 




Tn 
ID 


MO . 

NO : 


buy) ; 


GCATGGAATG 


(SEQ 


ID 


NO: 


511) ; 


ATGGAATGGA 


(SEQ 


ID 


NO: 


513) ; 


GGAATGGATG 


(SEQ 


ID 


NO: 


515) ; 


AATGGATGAC 


(SEQ 


ID 


NO: 


517) ; 
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ATGGATGACC 


(SEQ 


ID 


NO: 


518) ; 


TGGATGACCC 


(SEQ 


ID 


NO: 


519) 


GGATGACCCT 


(SEQ 


ID 


NO: 


520) ; 


GATGACCCTG 


(SEQ 


ID 


NO: 


521) 


ATGACCCTGA 


(SEQ 


ID 


NO: 


522) ; 


TGACCCTGAG 


(SEQ 


ID 


NO: 


523) 


GACCCTGAGA 


(SEQ 


ID 


NO : 


524) ; 


ACCCTGAGAG 


(SEQ 


ID 


NO: 


525) 


CCCTGAGAGA 


(SEQ 


ID 


NO : 


526) ; 


CCTGAGAGAG 


(SEQ 


ID 


NO: 


527) 


CTGAGAGAGA 


(SEQ 

X *— ^ J —' ML 


ID 


NO : 


528) ; 


TGAGAGAGAA 


(SEQ 


ID 


NO: 


529) 


GAGAGAGAAG 


(SEQ 


ID 


NO: 


530) ; 


AGAGAGAAGT 


(SEQ 


ID 


NO: 


531) 


GAGAGAAGTG 


(SEQ 


ID 


NO: 


532) ; 


AGAGAAGTGT 


(SEQ 


ID 


NO: 


533) 


GAGAAGTGTT 


(SEQ 


ID 


NO: 


534) ; 


AGAAGTGTTA 


(SEQ 


ID 


NO: 


535) 


GAAGTGTTAG 


(SEQ 


ID 


NO: 


536) ; 


AAGTGTTAGA 


(SEQ 


ID 


NO: 


537) 


AGTGTTAGAG 


(SEQ 


ID 


NO: 


538) ; 


GTGTTAGAGT 


(SEQ 


ID 


NO: 


53 9) 


TGTTAGAGTG 


(SEQ 


ID 


NO: 


540) ; 


GTTAGAGTGG 


(SEQ 


ID 


NO: 


541) 


TTAGAGTGGA 


(SEQ 


ID 


NO : 


542) ; 


TAGAGTGGAG 


(SEQ 


ID 


NO: 


543) 


AGAGTGGAGG 


(SEO 


ID 


NO : 


544 ) ; 


GAGTGGAGGT 


(SEQ 


ID 


NO: 


545) 


AGTGGAGGTT 


I SEO 


ID 

X, KJ 


NO • 


546) ; 


GTGGAGGTTT 


(SEQ 


ID 


NO: 


547) 


TGGAGGTTTG 


(SEQ 


ID 


NO : 


54 8) ; 


GGAGGTTTGA 


(SEQ 


ID 


NO: 


549) 


GAGGTTTGAC 


(SEQ 


ID 


NO : 


55 0) ; 


AGGTTTGACA 


(SEQ 


ID 


NO: 


551) 


GGTTTGACAG 


(SEQ 


ID 


NO : 


552) ; 


GTTTGACAGC 


(SEQ 


ID 


NO: 


553) 


TTTGACAGCC 


(SEO 


ID 


NO : 


554 ) ; 


TTGACAGCCG 


(SEQ 


ID 


NO: 


555) 


TGACAGCCGC 


(SEO 


ID 


NO : 


5 56) ; 


GACAGCCGCC 


(SEQ 


ID 


NO: 


557) 


ACAGCCGCCT 




ID 


NO : 


558) ; 


CAGCCGCCTA 


(SEQ 


ID 


NO: 


559) 






X ±J 




*j \j \j 1 / 


GCCGCCTAGC 


(SEO 


ID 


NO : 


561) 




f QPPl 


xu 








(SEO 


ID 


NO : 


563 ) 


VjL-^- J. ±\\j\^Jr\. X X 


V OHV 


X LJ 


KID • 


JO 4 ! / / 


CCTAGCATTT 


(SEQ 


ID 


NO : 


565) 


X /\vjj V_-wHl 1 1 1L 




in 

X i-J 


NO • 


_J D D / f 


TAGGATTTCA 


(SEQ 


ID 


NO : 


567) 


AOPATTTPAT 




TD 

X J-/ 


NO • 


568 ) • 


GCATTTCATC 


(SEQ 


ID 


NO: 


569) 


CAT T T CAT C A 


(SEQ 


ID 


NO : 


570) ; 


ATTTCATCAC 


(SEQ 


ID 


NO: 


571) 


TTTCATCACG 


(SEQ 


ID 


NO : 


572) ; 


TTCATCACGT 


(SEQ 


ID 


NO: 


573) 


TCATCACGTG 


(SEQ 


ID 


NO: 


574) ; 


CATCACGTGG 


(SEQ 


ID 


NO: 


575) 


ATCACGTGGC 


(SEQ 


ID 


NO: 


576) ; 


TCACGTGGCC 


(SEQ 


ID 


NO: 


577) 


CACGTGGCCC 


(SEQ 


ID 


NO: 


578) ; 


ACGTGGCCCG 


(SEQ 


ID 


NO: 


579) 


CGTGGCCCGA 


(SEQ 


ID 


NO: 


580) ; 


GTGGCCCGAG 


(SEQ 


ID 


NO: 


581) 




\, O Hi V 


X U 




-j 0 } t 


GG C C CGAG AG 


(SEQ 


ID 


NO : 


583 ) 


GCCCGAGAGC 


(SEQ 


ID 


NO: 


584) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO: 


585) 


CCGAGAGCTG 


(SEQ 


ID 


NO: 


586) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO 


587) 


GAGAGCTGCA 


(SEQ 


ID 


NO: 


588) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO 


:589) 


GAGCTGCATC 


(SEQ 


ID 


NO: 


5 9 0); 


AGCTGCATCC 


(SEQ 


ID 


NO 


:591) 


GCTGCATCCG 


(SEQ 


ID 


NO: 


592) ; 


CTGCATCCGG 


(SEQ 


ID 


NO 


:593) 
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TGCATCCGGA 


(SEO 


ID 


NO : 


594 ) • 






TD 


NT) * 


^ Q ^ ) • 


CATCCGGAGT 


(SEQ 


ID 


NO : 


5 96) ; 


ATCOCJGAnTA 


( cirri 

V O Hi v 


TV) 






TCCGGAGTAC 


(SEO 


ID 


NO - 


~j <j i f 




f qpn 


TFl 


jnu : 


3 y y ; , 


CGGAGTACTT 


(SEO 




NO * 

XNV> ■ 


\j \j \j i f 


uoiiu J. ri.v_ 1 1L 


^ oii^ 


XJJ 


jnu : 


bul; / 


GAGTACTTCA 














ID 


JNU : 


6 03) ; 


GTACTTCAAG 


( ^FO 


TD 

X i-J 


NO • 


D U *± / / 


1 ri\_ 1 1 U/i.f-.v.j-fi 




TTl 


JNU : 


6 05) ; 








JNU : 


b (J b / ; 


CTT CAAGAAC 


(SEQ 


ID 


NO : 


607) ; 


TTCAAGAACT 


(SEQ 


ID 


NO: 


608) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 


609) ; 


CAAGAACTGC 


(SEQ 


ID 


NO: 


610) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 


611) ; 


AGAACTGCTG 


(SEQ 


ID 


NO: 


612) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 


613) . 



A strain of HIV-1 according to claim 8 wherein said deletion encompasses at 



least one of the following decanucleotides from the LTR region of HIV-1 strain NL4-3 



GCTTTTTGCC 


(SEQ 


ID 


NO 


:652) ; 


CTTTTTGCCT 


(SEQ 


ID 


NO: 


653) ; 


TTTTTGCCTG 


(SEQ 


ID 


NO 


:654) ; 


TTTTGCCTGT 


(SEQ 


ID 


NO: 


655) ; 


TTTGC CTGTA 


(SEQ 


ID 


NO 


:656) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO: 


657) ; 


TGCCTGTACT 


(SEQ 


ID 


NO 


:658) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 


659) ; 


CCTGTACTGG 


(SEQ 


ID 


NO 


:660) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 


661) ; 


TGTACTGGGT 


(SEQ 


ID 


NO 


:662) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 


663) ; 


TACTGGGTCT 


(SEQ 


ID 


NO 


:664) ; 


ACTGGGTCTC 


(SEQ 


ID 


NO: 


665) ; 


CTGGGTCTCT 


(SEQ 


ID 


NO 


: 666) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO: 


667) ; 


GGGTCTCTCT 


(SEQ 


ID 


NO 


:668) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO: 


669) ; 


GTCTCTCTGG 


(SEQ 


ID 


NO 


:670) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 


671) ; 


CTCTCTGGTT 


(SEQ 


ID 


NO 


:672) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


673) ; 


CTCTGGTTAG 


(SEQ 


ID 


NO 


:674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


675) ; 


CTGGTTAGAC 


(SEQ 


ID 


NO 


:676) ; 


TGGTTAGAC C 


(SEQ 


ID 


NO: 


677) ; 


GGTTAGACCA 


(SEQ 


ID 


NO 


:678) ; 


GTTAGACCAG 


(SEQ 


ID 


NO: 


679) ; 


TTAGACCAGA 


(SEQ 


ID 


NO 


680) ; 


TAGACCAGAT 


(SEQ 


ID. 


NO: 


681) ; 


AGAC CAGATC 


(SEQ 


ID 


NO 


682) ; 


GACCAGATCT 


(SEQ 


ID 


NO: 


683) ; 


ACCAGATCTG 


(SEQ 


ID 


NO: 


684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO: 


685) ; 


CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO: 


687) ; 


GATCTGAGCC 


(SEQ 


ID 


NO: 


688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO: 


689) ; 


TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO: 


691) ; 


TGAGCCTGGG 


(SEQ 


ID 


NO: 


692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 


693) ; 


AGCCTGGGAG 


(SEQ 


ID 


NO: 


694) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO: 


695) ; 


CCTGGGAGCT 


(SEQ 


ID 


NO: 


696) ; 


CTGGGAGCTC 


(SEQ 


ID 


NO: 


697) ; 


TGGGAGCTCT 


(SEQ 


ID 


NO: 


698) ; 


GGGAGCTCTC 


(SEQ 


ID 


NO: 


699) ; 
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GGAGCTCTCT 


(SEQ 


ID 


NU 


. **7 A A \ 

: 7 0 0 ; 


AGCTCTCTGG 


(SEQ 


ID 


AT/""\ 


: / U Z ) 


/-i rp /-i rp /~i rp A A A rn 

CTC 1L1 ULrU 1 




ID 


ATA 

INU 


1 U4 ) 


ATATAA ATA TV 


/ gtta 


tpv. 
ID 


at a 

JNU 


/Ub j 


prnnri AT a A AT 


/ GT7A 


ID 


ATA 

INU 


7HQ \ 

/ U o ; 


I AAU 1 Ab 


v oJ^y 


TPi 
ID 


ATA 
JNU 


/ XU J 










71 0 \ 


AACTAGGGAA 


f 53F10 


TO 

-L, JLV 


NO 


714 ) 


CTAGGGAACC 


(SEO 


ID 


NO 


716 ) 








NO 


flO/ 


AA7A IPPPAPT 
VjLj AAL U L AU 1 




TP) 
ID 


ATA 

INU 


1 ZKJ } 


AACCCACTGC 


(SEQ 


ID 


NO 


722 ) 


CCCACTGCTT 


(SEQ 


ID 


NO 


724 ) 


CACTGCTTAA 


(SEQ 


ID 


NO: 


726) 


CTGCTTAAGC 


(SEQ 


ID 


NO : 


728) 


GCTTAAGCCT 


(SEQ 


ID 


NO : 


73 0 ) 


TTAAGCCTCA 


(SEQ 


ID 


NO : 


732 ) 


A A rTPTPTV AT" 


f CPA 


T Pv 
ID 


VTA . 

INU : 


/i4) 


A A AT A A A T 7\ A 

uLL 1 LAA1AA 


(SEQ 


T P» 

ID 


ATA • 

JNU : 


/JD / 


AT A A ATA A A A 

U 1 LAAI AAAb 


(SEQ 


ID 


ATA . 

NO : 


T O O \ 

/jo) 


CAATAAAGCT 


(SEQ 


ID 


NO : 


74 0 ) 


ATAAAGCTTG 


(SEQ 


ID 


NO : 


742 ) 


AAAGCTTGCC 


(SEQ 


ID 


NO : 


744) 


AGCTTGCCTT 


(SEQ 


ID 


NO: 


746) 


CTTGCCTTGA 


(SEQ 


ID 


NO- 


748) 


TGCCTTGAGT 


(SEQ 


ID 


NO 


750) 


CCTTGAGTGC 


(SEQ 


ID 


NO 


752 ) 


TTGAGTGCTT 


(SEQ 


ID 


NO 


754 ) 


A A ATA ATT A A 


( opa 


ID 


ATA 

JNU 


1 c c \ 
Zoo) 


pita attata a p 

X UL X X UUib 


/ otto 


TPi 


ATA 


too) 


a attaa aptr 


/ CPA 


TFi 
ID 


JNU 


1 D U ) 


1 X UAACj 1 AL 1 


/ OPA 


ID 


ATA 
INU 


/ OA ) 


CAAGTAGTGT 


(SEQ 


ID 


NO 


764) 


AGTAGTGTGT 


(SEQ 


ID 


NO 


766) 


TAGTGTGTGC 


(SEQ 


ID 


NO 


768) 


GTGTGTGCCC 


(SEQ 


ID 


NO 


:770) 


GTGTGCCCGT 


(SEQ 


ID 


NO 


:772) 


GTGCCCGTCT 


(SEQ 


ID 


NO 


: 774) 
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GAGCTCTCTG 


(SEQ 


ID 


NO 


. *"7 r\ n \ 
' /O 1 ) 


GCTCTCTGGC 




T 

ID 


ATA 
JNU 


H Pi "5 \ 


ILILl X A 


/ Ct?A 


T Pi 

ID 


ATA 


7nc;^ 




/ Q TP A 


TP) 
ID 


IN w 


/ U / } 


1 tjf^V- 1 AAL X A 




TP) 


ATA 


/ U zf } 


X AAL X ALjLi 


/ CPA 


TV) 

X L/ 


MO - 


71 1 \ 


X rlriv^ X rt.\jvjvjrt 






NO 


71 ^ ) 


ACTAGGGAAC 


(SEQ 


ID 


NO 


715 ) 


T AGGG AA P C C 


(SEO 


ID 


NO : 


717) 






TD 


MO * 

1NVJ 


71 9) 


n7\r ppp7\ ATA 




TPt 
ID 


ATA » 


/ Z± ) 


ACCCACTGCT 


/ O 7—) /"N 

(SEQ 


ID 


NO : 


»"7 /-v \ 

723 ) 


CCACTGCTTA 


(SEQ 


ID 


NO : 


725) 


ACTGCTTAAG 


(SEQ 


ID 


NO: 


727) 


TGCTTAAGCC 


(SEQ 


ID 


NO : 


729) 


CTTAAGCCTC 


(SEQ 


ID 


NO : 


731) 


TAAGCCTCAA 


V SEQ 


T T*V 

ID 


NO : 


733 ; 


A AAATAA ATA 


/ C T? A 


T Pi 

ID 


ATA . 


T3C\ 

/Jo/ 


nnnipl\ A T* A A A 
v-U X LAAI AAA 


/ CPA 


T Pi 

ID 


ATA - 


*7 *5 "7 \ 
16 1) 


rnriTi ATA A A OA 

T. LAAI AAAQjC 


(SEQ 


TP\ 

ID 


ATA . 

NO : 


/ j y ) 


tv tv m tv tv tv /"i rn m 

AATAAAGCTT 


(SEQ 


ID 


NO : 


741 ) 


TAAAGCTTGC 


(SEQ 


ID 


NO : 


743 ) 


AAGCTTGCCT 


(SEQ 


ID 


NO: 


745) 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


747) 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) 


GCCTTGAGTG 


(SEQ 


ID 


NO: 


751) 


CTTGAGTGCT 


(SEQ 


ID 


NO 


753 ) 


TGAGTGCTTC 


(SEQ 


ID 


NO 


755) 


A ATA ATT A A A 
A\j X LjL X X LAA 


( q r? a 


TPl 


MA 


H C *7 \ 


X v_3Vw* X X v-iiiivj X 


\ <^ Xj\^ 


TPl 

X. XJ 


1st w 


7C Q \ 


ffpfpp7\ 7\ ATAA 
L 1 1 LAA^jr X AVj 




TP\ 
ID 


ATA 

JNU 


id \ 
/ 0 L) 


rp a A A AT A ATA 
1 LAAb 1 Atjf 1 


/ CPA 


TPt 

ID 


ATA 

NU 


/6 J ) 


AAGTAGTGTG 


(SEQ 


ID 


NO 


:765) 


GTAGTGTGTG 


(SEQ 


ID 


NO 


:767) 


AGTGTGTGCC 


(SEQ 


ID 


NO 


:769) 


TGTGTGCCCG 


(SEQ 


ID 


NO 


:771) 


TGTGCCCGTC 


(SEQ 


ID 


NO 


:773) 


TGCCCGTCTG 


(SEQ 


ID 


NO 


:775) 
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GCCCGTCTGT 


(SEQ 


ID 


NO : 


776) ; 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) ; 


CCGTCTGTTG 


(SEQ 


ID 


NO : 


778) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) ; 


GTCTGTTGTG 


(SEQ 


ID 


NO : 


780) ; 


TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) ; 


CTGTTGTGTG 


(SEQ 


ID 


NO: 


782) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 


783) ; 


GTTGTGTGAC 


(SEQ 


ID 


NO: 


784) ; 


TTGTGTGACT 


(SEQ 


ID 


NO: 


785) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 


786) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


78 7) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 


788) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) ; 


TGTGACTCTG 


(SEQ 


ID 


NO : 


790) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 


TGACTCTGGT 


(SEQ 


ID 


NO: 


792) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 


793) ; 


ACTCTGGTAA 


(SEQ 


ID 


NO: 


794) ; 


CTCTGGTAAC 


(SEQ 


ID 


NO: 


795) ; 


TCTGGTAACT 


(SEQ 


ID 


NO: 


796) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 


797) ; 


TGGTAACTAG 


(SEQ 


ID 


NO: 


798) ; 


GGTAACTAGA 


(SEQ 


ID 


NO: 


799) . 



15. A strain of HIV-1 according to claim 8 having the identifying characteristics of 
HIV-1 isolate ECACC V94101706 or HIV-1 isolate ECACC V94103 1 169 or isolate C54 
deposited at ECACC on 14 February, 1995 under Accession Number . 

16. An isolated strain of HIV-1 which is reactive to antibodies to a glycoprotein of 
HIV- 1 , is capable of inducing an immune response to at least one of gag, pol and/or env 
and which is incapable of directing synthesis of a nef gene product or a full length nef 
gene product. 

17. An isolated strain of HIV-1 according to claim 16 wherein the glycoprotein is 
at least one of gp41-45, gpl20 and/or gpl60. 

18. A nucleic acid molecule or a part, fragment or derivative thereof from an HIV-1 
isolate defined in any one of claims 1 to 16. 

19. A molecular infectious clone comprising a nucleic acid molecule according to 
claim 18. 

20. A method for inhibiting or reducing productive infection of an individual by a 
pathogenic strain of HIV-1, said method comprising administering to a subject a non- 
pathogenic isolate of HIV-1 in an amount effective to infect target cells and to generate 
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target cells carrying DNA derived from said non-pathogenic HIV-1. 

21. A method according to claim 20 wherein said isolate is capable of stimulating 
an immune response in a human or primate against at least one glycoprotein on HIV-1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected human or primate 
subject. 

22. A method according to claim 21 wherein said isolate carries one or more 
mutations in its genome resulting in the inability to direct synthesis of at least one 
polypeptide or protein from a pathogenic strain of HIV-1. 

23. A method according to claim 17 wherein said isolate carries one or more 
mutations in its genome resulting in said genome directing synthesis of a truncated form ^ 
of a polypeptide or protein from a pathogenic strain of HIV-1. 

24. A method according to claim 22 or 23 wherein said HIV-1 isolate carries a ; ... ^ 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HIV-1 genome. 

25. A method according to claim 24 wherein said HIV-1 isolate is reactive to ^ ^ 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least ^ 
ten contiguous nucleotides in a region corresponding to the nef gene and/or LTR region 

of pathogenic HIV-1. 

26. A method according to claim 25 wherein said HIV-1 isolate is capable of 
inducing an immune response to at least one of gag, pol and/or env. 

27. A method according to claim 26 wherein said HIV-1 isolate carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 
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28. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

29. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

30. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

31. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

32. A method according to claim 27 wherein said deletion encompasses at least one 
of the following decanucleotides from the nef gene of HIV-1 strain NL43: 
ATGGGTGGCA (SEQ ID NO: 2); TGGGTGGCAA (SEQ ID NO : 3 ) ; 
GGGTGGCAAG (SEQ ID NO:4); GGTGGCAAGT (SEQ ID NO: 5); 
GTGGCAAGTG (SEQ ID NO: 6); TGGCAAGTGG (SEQ ID NO:7); 
GGCAAGTGGT (SEQ ID NO: 8) ; GCAAGTGGTC (SEQ ID NO: 9) ; 
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PAAGTGGTCA 


(SEO 


ID 


NO : 


10) <; 


ZVfVTGGTGAAA 


(SEO 


ID 


NO: 


12) ; 


1 VjrvJf J- \— rtj^rt-TTuri. 


( SEO 
v ° ^ V 


ID 


NO : 


14) ; 


GTGAAAAAGT 


(SEO 


ID 


NO : 


16) ; 


GAAAAAGTAG 


(SEO 


ID 


NO : 


18) ; 


AAAAGTAGTG 


(SEO 


ID 


NO : 


20) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


22) ; 


GTAGTGTGAT 


(SEQ 


ID 


NO: 


24) ; 


AGTGTGATTG 


(SEQ 


ID 


NO: 


26) ; 


TGTGATTGGA 


(SEQ 


ID 


NO: 


28) ; 


TGATTGGATG 


(SEQ 


ID 


NO: 


30) ; 


ATTGGATGGC 


(SEQ 


ID 


NO : 


32) ; 


tggatggggt 




TD 


NO : 


34) ; 






TD 


NO • 


36 ) ; 




(SEO 


ID 


NO : 


38) ; 


GCCTGGTGTA 


(SEO 


ID 


NO : 


40) ; 


CTGCTGTAAG 


(SEQ 


ID 


NO: 


42) ; 


GCTGTAAGGG 


(SEO 


ID 


NO : 


44) ; 


TGTAAGGGAA 


(SEO 


ID 


NO: 


46) ; 


TA AGGGA A AG 




TD 


NO : 


48) ; 






TD 
x u 




^0) - 
~j \j i i 






Tn 
X u 




^9 ) * 


AAAvjAA J. bAo 




Tn 










JL U 


MP) 








x U 




. JO J , 






th 

A. i-J 




• n ^ 






Tn 










ID 


NO 


: 64 ) 


G AG G TG AG G C 


(SEO 


ID 


NO 


: 66) 


GGTG AGG GAG 


(SEO 


ID 


NO 


: 68) 


TGAGCGAGCA 


(SEO 


ID 


NO 


: 70) 


AGGGAGGAGG 


(SEO 


ID 


NO 


: 72) 


CCAGCAGCAG 


(SEQ 


ID 


MU 


. *"7 /! ^ 


AGCAGCAGAT 


(SEQ 


ID 


NO 


:76) 


CAGCAGATGG 


(SEQ 


ID 


NO 


: 78) 


GCAGATGGGG 


(SEQ 


ID 


NO 


:80) 


AGATGGGGTG 


(SEQ 


ID 


NO 


: 82) 


ATGGGGTGGG 


(SEQ 


ID 


NO 


: 84) 



AAGTGGTCAA 

GTGGTCAAAA 

GGTCAAAAAG 

TCAAAAAGTA 

AAAAAGTAGT 

AAAGTAGTGT 

AGTAGTGTGA 

TAGTGTGATT 

GTGTGATTGG 

GTGATTGGAT 

GATTGGATGG 

TTGGATGGCC 

GGATGGCCTG 

ATGGCCTGCT 

GGCCTGCTGT 

CCTGCTGTAA 

TGCTGTAAGG 

CTGTAAGGGA 

GTAAGGGAAA 

AAGGGAAAGA 

GGGAAAGAAT 

GAAAGAATGA 

AAGAATGAGA 

GAATGAGACG 

ATGAGACGAG 

GAGACGAGCT 

GACGAGCTGA 

CGAGCTGAGC 

AGCTGAGCCA 

CTGAGCCAGC 

GAGCCAGCAG 

GCCAGCAGCA 

CAGCAGCAGA 

GCAGCAGATG 

AGCAGATGGG 

CAGATGGGGT 

GATGGGGTGG 

TGGGGTGGGA 



(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 
(SEQ ID NO 

(SEQ ID 

(SEQ ID 
(SEQ ID NO 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 

(SEQ 



(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 



ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 



(SEQ ID 
(SEQ ID 



ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 

ID 



NO: 11) , 
NO: 13) , 
NO: 15) , 
NO: 17) , 

:19) ; 
NO: 21) , 
NO: 23) , 
:25) ; 
NO: 27) 
NO:29) 
NO:31) 
NO: 33) 
NO : 3 5 ) 
NO:37) 
NO:39) 
NO : 4 1 ) 
NO: 43) 
NO: 45) 
NO: 47) 
NO:49) 
NO: 51) 
NO: 53) 
NO : 55 ) 
NO: 57) 
NO: 59) 
NO:61) 
NO: 63) 
NO: 65) 
NO: 67) 
NO:69) 
NO: 71) 
NO: 73) 
NO: 75) 
NO: 77) 
NO: 79) 
NO: 81) 
NO: 83) 
NO: 85) 
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GGGGTGGGAG (SEQ ID NO: 86); 
GGTGGGAGCA (SEQ ID NO: 88); 
TGGGAGCAGT ( SEQ ID NO : 9 0 ) ; 
GGAGCAGTAT (SEQ ID NO:92); 

AGCAGTATCT (SEQ ID NO: 94); 

CAGTATCTCG (SEQ ID NO: 96); 

GTATCTCGAG (SEQ ID NO: 98); 

ATCTCGAGAC ( SEQ ID NO : 1 0 0 ) ; 

CTCGAGACCT (SEQ ID NO: 102); 

CGAGACCTAG (SEQ ID NO: 104); 

AGACCTAGAA ( SEQ ID NO : 1 0 6 ) ; 

ACCTAGAAAA (SEQ ID NO:108); 

CTAGAAAAAC (SEQ ID NO:110); 

AGAAAAACAT (SEQ ID NO: 112); 

AAAAACATGG (SEQ ID N0:114); 

AAACATGGAG (SEQ ID NO: 116); 

ACATGGAGCA ( SEQ ID NO : 1 1 8 ) ; 

ATGGAGCAAT (SEQ ID NO: 120); 

GGAGCAATCA (SEQ ID NO: 122); 

AGCAATCACA (SEQ ID NO: 124); 

CAATCACAAG (SEQ ID NO: 126); 

AT CACAAGTA (SEQ ID NO: 128); 

CACAAGTAGC (SEQ ID NO: 130); 

CAAGTAG CAA (SEQ ID NO:132); 
AGTAGCAATA (SEQ ID NO: 134) ; 
TAGCAATACA (SEQ ID NO: 136); 
GCAATACAGC (SEQ ID NO: 138); 
AATACAGCAG (SEQ ID NO: 140); 
TACAGCAGCT (SEQ ID NO: 142); 
CAGCAGCTAA (SEQ ID NO:144); 
GCAGCTAACA (SEQ ID NO:146); 
AGCTAACAAT (SEQ ID NO: 148) ; 
CTAACAATGC (SEQ ID NO: 150); 
AACAATGCTG (SEQ ID NO:152); 
CAATGCTGCT (SEQ ID NO: 154); 
ATGCTGCTTG (SEQ ID NO: 156); 
GCTGCTTGTG (SEQ ID NO: 158); 
TGCTTGTGCC (SEQ ID NO: 160); 



GGGTGGGAGC 


(SEQ 


ID 


NO : 8 7 ) 


GTGGGAGCAG 


(SEQ 


ID 


NO : 8 9 ) 


GGGAGCAGTA 


(SEQ 


ID 


NO: 91) 


GAGCAGTATC 


(SEQ 


ID 


NO: 93) 


GCAGTATCTC 


(SEQ 


ID 


NO: 95) 


AGTATCTCGA 


(SEQ 


ID 


NO:97) 


TATCTCGAGA 


(SEQ 


ID 


NO: 99) 


TCTCGAGACC 


(SEQ ID NO:101) 


TCGAGACCTA 






MO 


: xu j ) ; 


GAGACCTAGA 


(SbQ 


ID 


JNU 


: 1 U b ; ; 


GACCTAGAAA 


(SEQ 


ID 


NO 


: 1 0 7 ) ; 


CCTAGAAAAA 


(SEQ 


ID 


NO 


: 1 0 9 ) ; 


TAGAAAAACA 


(SEQ 


ID 


NO 


111) ; 


GAAAAACATG 


(SEQ 


ID 


NO 


113 ) ; 


AAAACATGGA 


(SEQ 


ID 


NO 


115) ; 


AACATGGAGC 


(SEQ 


ID 


NO. 


117) ; 


CATGGAGCAA 


(SEQ 


ID 


NO, 


119) ; 


TGGAGCAATC 


(SEQ 


ID 


NO : 


121) ; 


GAGCAATCAC 


(SEQ 


ID 


NO : 


123) ; 


GCAATCACAA 


(SEQ 


ID 


NO : 


125) ; 


AATCACAAGT 


(SEQ 


ID 


NO : 


12 7) ; 


T CACAAGT AG 


(SEQ 


ID 


NO : 


12 9) ; 


ACAAGTAGCA 


( bay 


XL) 


rju : 


ui) ; 


AAGTAGCAAT 


( CPA 


TTi 


fr[r\ . 
JNU : 


1 J 3 ) ; 


GTAGCAATAC 




T pi 


rJvJ : 


x j o ) ; 


AGCAATACAG 


( CPA 

v any 


XL) 


vrr\ . 

r*\J : 


1 *3 *"7 \ - 


CAATACAGCA 


(SEQ 


ID 


NO : 


13 9) ; 


ATACAGCAGC 


(SEQ 


ID 


NO: 


141) ; 


ACAGCAGCTA 


(SEQ 


ID 


NO: 


143) ; 


AGCAGCTAAC 


(SEQ 


ID 


NO: 


145) ; 


CAGCTAACAA 


(SEQ 


ID 


NO: 


147) ; 


GCTAACAATG 


(SEQ 


ID 


NO: 


14 9) ; 


TAACAATGCT 


(SEQ 


ID 


NO: 


151) ; 


ACAATGCTGC 


(SEQ 


ID 


NO: 


153) ; 


AATGCTGCTT 


(SEQ 


ID 


NO: 


155) ; 


TGCTGCTTGT 


(SEQ 


ID 


NO: 


157) ; 


CTGCTTGTGC 


(SEQ 


ID 


NO: 


159) ; 


GCTTGTGCCT 


(SEQ 


ID 


NO: 


161) ; 
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C 1 1 G I GGG 1 G 


/ cup 


TD 
X U 


MP) - 


X O ^ J / 


xG 1 GGG I GGG 




TFl 




X O / , 


1 GGG 1 GGG JL A 


/ CI? Pi 




MP - 

XNW . 




LL 1 GGG 1 AGA 


/ OT?P 


X JJ 


MP • 


IOCS j , 


1 GGG 1 AGAAG 


/ CUP\ 


X U 


TSTP * 
V4\J • 


X / U ) f 


GG 1 AGAA(jl_A 




X U 


TJP • 


X I £ ) f 


TAP A ^nfAfA 




TT\ 

X LJ 




1 74 ) * 

X / rt / f 


APPAPA AH 


(SEP 


ID 

J. LJ 


NO : 


176 ) ; 


APPAPA AO AO 


( ^EO 


ID 

X xj 


NO : 


178 ) ; 


p a p a a ci a pp A 




TD 




i an) * 

X O v / / 


GAAGAGGAGG 




XU 


1STP • 
JNU t 


X O £* ) t 


AGAGGAGGAA 


(SEQ 


ID 


NO : 


184) ; 


AGGAGGAAGA 


(SEQ 


ID 


NO : 


186) ; 


GAGGAAGAGG 


(SEQ 


ID 


NO: 


18 8) ; 


GGAAGAGGTG 


(SEQ 


ID 


NO : 


19 0) ; 


AAGAGGTGGG 


f CT?P\ 

(SEQ 


ID 


: 


ly^j ; 


GAGG 1 GGG 1 I 


/ o tt» r~\ 

(SEQ 


XU 


inu : 


1j4 J ; 


GG 1 GGG 1111 


(SEQ 


ID 


JnU : 


X y b ) ; 


TGGGTTTTCC 


(SEQ 


ID 


NO : 


1 Q Q \ . 

1 y o ) ; 


GGTTTTCCAG 


(SEQ 


ID 


NO : 


o /*\ /*\ \ 

200) ; 


TTTTCCAGTC 


(SEQ 


ID 


NO : 


2 02) ; 


TTCCAGTCAC 


(SEQ 


ID 


NO : 


2 04 ) ; 


CCAGTCACAC 


(SEQ 


ID 


NO : 


2 06) ; 


AGTCACACCT 


(SEQ 


ID 


NO : 


208) ; 


TCACACCTCA 


(SEQ 


ID 


NO: 


210) ; 


ACACCTCAGG 


(SEQ 


ID 


NO : 


212) ; 


ACCTCAGGTA 


(SEQ 


ID 


NO: 


214) ; 


CTCAGGTACC 


(SEQ 


ID 


NO : 


216) ; 


GAGG1AGG1 1 


/ OTTO 


XU 


ViKJ . 








TT) 

X LJ 


LH\J . 


4* \j i t 


TAPPTTTA AP 




TT) 
11/ 


MP 
liv 




pptttaaprp 
LL1 1 1AAGAG 


A CPA 


XU 


IMP 




TTTAAGACCA 


(SEQ 


ID 


NO 


: 226) ; 


TAAGACCAAT 


(SEQ 


ID 


NO 


:228) ; 


AGACCAATGA 


(SEQ 


ID 


NO 


:230) ; 


ACCAATGACT 


(SEQ 


ID 


NO 


:232) ; 


CAATGACTTA 


(SEQ 


ID 


NO 


:234) ; 


ATGACTTACA 


.(SEQ 


ID 


NO 


:236) ; 
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TTHTGCCTGG 


(SEO 


ID 


NO: 


163) ; 


PTGPCTGGCT 


(SEQ 


ID 


NO : 


165) ; 


GCCTGGCTAG 


(SEQ 


ID 


NO: 


16 7) ; 


CTGGCTAGAA 


(SEQ 


ID 


NO: 


16 9) ; 


OC4PTAGAAGC 


(SEQ 


ID 


NO: 


171) ; 


CTAGAAGCAC 


(SEQ 


ID 


NO: 


173) ; 


AGAAGCACAA 


(SEQ 


ID 


NO: 


175) ; 


AAGCACAAGA 


(SEQ 


ID 


NO: 


177) ; 


GCACAAGAGG 


(SEQ 


ID 


NO: 


179) ; 


ACAAGAGGAG 


(SEQ 


ID 


NO: 


181) ; 


AAGAGGAGGA 


(SEQ 


ID 


NO: 


183) ; 


PAPPAHPA AP 




ID 


NO : 


185) ; 


GG AG bAHbHb 




TD 

X LJ 


NO * 

LH \J » 


187) ; 


APT*7V AP7\PPT 

AGGAAGAGG 1 


/ QT7P 


TD 

X LJ 


NO • 


X \J / / 


PA APAPPTPP 


(SEO 


ID 


NO : 


191) ; 


APAPPTGGGT 


(SEO 


ID 


NO : 


193) ; 


APPTPPPTTT 


(SEO 


ID 


NO: 


195) ; 


VJ X OVJw X X X X 


(SEO 


ID 


NO : 


197) ; 






ID 

X X-f 


NO : 


199) ; 


Gl 1 1 1 LUibi 




TD 

X i-J 


NO - 


\J A. ) f 


111 GGAG 1 GA 


^ QT?P 


xu 


MP) . 


\j j y / 


1 GGAG 1 GAGA 




XXJ 


NO • 

XMW . 


£*\J Zj } § 


GAG 1 GAG AG G 


( QT7P 


X LJ 


TJO • 


0C\1) * 

£t\J 1 } f 


GTCACACCTG 


/ C UP\ 


Tn 
xu 


MP - 




CACACCTCAG 


(SEQ 


ID 


NO : 


zll) ; 


CACCTCAGGT 


(SEQ 


XU 


Inu : 


^ij / ; 


CCTCAGG1 AG 




Tn 

X D 


MP 
IN KJ 


<il j J / 


X X jH-^-U, X 




TD 

X LJ 


NO 


217) ; 


AGGTACCTTT 


(SEQ 


ID 


NO 


.219) ; 


GTACCTTTAA 


(SEQ 


ID 


NO 


:221) ; 


ACCTTTAAGA 


(SEQ 


ID 


NO 


:223) ; 


CTTTAAGACC 


(SEQ 


ID 


NO 


:225) ; 


TTAAGACCAA 


(SEQ 


ID 


JNU 


: 22. 1 ) ; 


AAGACCAATG 


(SEQ 


ID 


NO 


:229) ; 


GACCAATGAC 


(SEQ 


ID 


NO 


:231) ; 


C CAATG ACTT 


(SEQ 


ID 


NO 


:233) ; 


AATGACTTAC 


(SEQ 


ID 


NO 


:235) ; 


TGACTTACAA 


(SEQ 


ID 


NO 


:237) ; 
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GACTTACAAG ( SEQ ID NO : 2 3 8 ) ; 
CTTACAAGGC (SEQ ID NO: 240); 
TACAAGGCAG (SEQ ID NO: 242); 
CAAGGCAGCT (SEQ ID NO:244); 
AGGCAGCTGT (SEQ ID NO: 246); 
GCAGCTGTAG (SEQ ID NO: 248); 
AGCTGTAGAT (SEQ ID NO: 2 50); 
CTGTAGATCT (SEQ ID NO: 252); 
GTAGATCTTA (SEQ ID NO:254); 
AGATCTTAGC (SEQ ID NO: 2 56); 
ATCTTAGCCA (SEQ ID NO: 2 58); 
CTTAGCCACT (SEQ ID NO:260); 
TAGCCACTTT (SEQ ID NO: 2 62); 
GCCACTTTTT (SEQ ID NO: 264); 
CACTTTTTAA (SEQ ID NO: 2 66); 
CTTTTTAAAA (SEQ ID NO:268); 
TTTTAAAAGA (SEQ ID NO: 270); 
TTAAAAGAAA (SEQ ID NO: 2 72); 
AAAAGAAAAG (SEQ ID NO:274); 
AAGAAAAGGG (SEQ ID NO: 2 76); 
GAAAAGGGGG (SEQ ID NO: 278); 
AAAGGGGGGA (SEQ ID NO:280); 
AGGGGGGACT (SEQ ID NO: 282); 
GGGGGACTGG (SEQ ID NO: 2 84); 
GGGACTGGAA ( SEQ ID NO : 2 8 6 ) ; 
GACTGGAAGG (SEQ ID NO: 288); 
CTGGAAGGGC (SEQ ID NO: 2 90); 
GGAAGGGCTA (SEQ ID NO:292); 
AAGGGCTAAT (SEQ ID NO: 294); 
GGGCTAATTC (SEQ ID NO:296); 
GCTAATTCAC (SEQ ID NO: 2 98); 
TAATTCACTC (SEQ ID NO: 300); 
ATTCACTCCC (SEQ ID NO:302); 
TCACTCCCAA (SEQ ID NO:304); 
ACTCCCAAAG (SEQ ID NO: 3 06); 
TCCCAAAGAA (SEQ ID NO:308); 
CCAAAGAAGA (SEQ ID NO : 3 1 0 ) ; 
AAAGAAGACA ( SEQ ID NO : 3 12 ) ; 
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ACTTACAAGG 


(SEQ 


ID 


NO 


: 239) 


T T ACAAGG CA 


(SEQ 


ID 


NO 


: 241) 


ACAAGGCAGC 


(SEQ 


ID 


NO 


:243) 


AAGGCAGCTG 


(SEQ 


ID 


NO 


:245) 


GGCAGCTGTA 


(SEQ 


ID 


NO 


: 24 7) 


CAGCTGTAGA 


(SEQ 


ID 


NO 


: 249) 


\3\~ 1 vj i ACjrA I C 


(SEQ 


ID 


NO 


: 251) 


1 1 AGATCTT 


(SEQ 


ID 


NO 


: 253 ) 


T AGATCTT AG 


(SEQ 


ID 


NO 


: 255) 


GATCTTAGCC 


(SEQ 


ID 


NO 


:257) 


TCTTAGCCAC 


(SEQ 


ID 


NO 


:259) 


TTAGCCACTT 


(SEQ 


ID 


NO 


: 261) 


AGCCACTTTT 


(SEQ 


ID 


NO 


:263) 


CCACTTTTTA 


(SEQ 


ID 


NO 


:265) 


ACTTTTTAAA 


(SEQ 


ID 


NO 


:267) 


TTTTTAAAAG 


(SEQ 


ID 


NO 


:269) 


TTTAAAAGAA 


(SEQ 


ID 


NO: 


271) 


TAAAAGAAAA 


(SEQ 


ID 


NO: 


273) 


AAAG AAAAGG 


(SEQ 


ID 


NO: 


275) 


AGAAAAGGGG 


(SEQ 


ID 


NO: 


277) 


AAAAGGGGGG 


(SEQ 


ID 


NO: 


279) 


TV TV /*T /™1 /~ 1 /^t /"T TV j^i 

AAGGGGGGAC 


(SEQ 


ID 


NO: 


281) 


GGGGGGACTG 


(SEQ 


ID 


NO : 


283 ) 


GGGGACTGGA 


(SEQ 


ID 


NO: 


285) 


/T/*T TV r^T y^i TV TV 

GGACTGGAAG 


(SEQ 


ID 


NO: 


287) 


ACTGGAAGGG 


(SEQ 


ID 


NO: 


289) 


TGGAAGGGCT 


(SEQ 


ID 


NO: 


291) 


/*! TV TV /T i^T >"im TV TV 

GAAGGGCTAA 


(SEQ 


ID 


NO: 


293) 


AGGGCTAATT 


(SEQ 


ID 


NO: 


295) 


GGCTAATTCA 


(SEQ 


ID 


NO: 


297) , 


CTAATT CACT 


(SEQ 


ID 


NO: 


299) , 


AATTCACTCC 


(SEQ 


ID 


NO : 


301) , 


TTCACTCCCA 


(SEQ 


ID 


NO: 


303) , 


pj\ prn/-»p/-i> TV TV 

UVL I CCCAAA 


(SEQ 


ID 


NO: 


305) , 


CTCCCAAAGA 


(SEQ 


ID 


NO: 


307) , 


CCCAAAGAAG 


(SEQ 


ID 


NO: 


309) , 


CAAAGAAGAG 


(SEQ 


ID 


NO: 


311) ; 


AAGAAGACAA 


(SEQ 


ID 


NO: 


313) ; 
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AoAA^A^-rt-rl^ 




TT) 

XU 


"MO • 


114) • 


GAAGACAAGA 


(SEQ 


ID 


NO: 


315) ; 






TT) 
X u 




O X O / f 


AGACAAGATA 


(SEQ 


ID 


NO: 


317) ; 


o A O A A ^ A r P A H" 1 

(jAC AA<J A 1 i-i 1 




X. u 


Kin • 


O X O / / 


AC AAGAT AT C 


(SEQ 


ID 


NO : 


319) ; 






TD 


vrp, . 
inw . 


7 c> n ^ • 


A AGATAT C CT 

rvnvjn x x ^ — v — • x 


(SEO 


ID 


NO : 


321) ; 






TT) 

X U 




JL* *-J / ( 


GATATCCTTG 


(SEQ 


ID 


NO: 


323) ; 


ATATPPTTnA 




TT) 




124 ) • 
*j ~ / / 


TATCCTTGAT 


(SEQ 


ID 


NO: 


325) ; 


ATPPTTGATC 


(SEO 


ID 


NO : 


326) ; 


TCCTTGATCT 


(SEQ 


ID 


NO: 


327) ; 


CCTTGATCTG 


(SEO 


ID 


NO: 


328) ; 


CTTGATCTGT 


(SEQ 


ID 


NO: 


329) ; 


TTGATCTGTG 


(SEQ 


ID 


NO: 


330) ; 


TGATCTGTGG 


(SEQ 


ID 


NO: 


331) ; 


GATCTGTGGA 


(SEQ 


ID 


NO: 


332) ; 


ATCTGTGGAT 


(SEQ 


ID 


NO: 


333) ; 


TCTGTGGATC 


(SEQ 


ID 


NO: 


334) ; 


CTGTGGATCT 


(SEQ 


ID 


NO: 


335) ; 


TGTGG AT CTA 


(SEO 


ID 


NO : 


33 6) ; 


GTGGATCTAC 


(SEQ 


ID 


NO: 


337) / 


TPnATfTA pp 




TT) 

XU 


NO • 

XNW • 


j j 0 / /■ 


GGATCTACCA 


(SEQ 


ID 


NO: 


339) ; 


bAlLlALLAL 




ID 


wn * 
INU . 




ATfTAPPAPA 


(SEO 


ID 


NO : 


341) ; 


*T*/**"T , A /"'/"'A P" 1 A r~* 


/ o cp^ 
(SEQ 


rn 

XU 


JNU : 


*3 A O ^ • 


PTAPPAPAPA 


(SEO 


ID 


NO : 


343) ; 


T AC CAUAuAL 


(bh*y 


X U 


ViKJ I 




A PP APA P A PA 


(SEO 


ID 


NO : 


345) ; 


L. LALALALAA 


/ OTTO 


XL) 




O *x D J f 


PAPAPACAAG 


(SEO 


ID 


NO : 


34 7) ; 


a P" 1 a r" 1 a O A A P^P" 1 




XU 


ran • 

IN w * 


0 *± 0 ) f 


PAPACAAGGC 


(SEO 


ID 


NO: 


349) ; 


A P" 1 A P" 1 A A PPPT 
ALALAAbbL 1 




jn 
XU 


IN . 




P A P AAGG C T A 


(SEO 


ID 


NO: 


3 51) ; 


ACAAGGCTAC 


(SEQ 


XL) 


TJP"1 




PAAPPPTAPT 


(SEO 


ID 


NO : 


3 53) ; 


AAGGCTACTT 


(SEQ 


ID 


NO 


3 54) ; 


AIjVjVw 1 At lit 


/ qpn 
\ Dny 


Tn 

XU 




-J — > — ' y / 


GGCTACTTCC 


(SEQ 


ID 


NO 


3 56) ; 


x AC 1 1 LLL 




XU 


MP) - 


jo/; / 


CTACTTCCCT 


(SEQ 


ID 


NO 


3 58) ; 






*rr\ 

XU 


MP) . 


■3 -3 J / / 


ACTTCCCTGA 


(SEQ 


ID 


AT/"*\ 

NO 


: 3 o 0 j ; 






TT) 
x u 


NO 


-3 O X / / 


TTCCCTGATT 


(SEQ 


T TA 
XL) 




:obzJ ; 


TPPPTPATTP 


( SEO 


ID 


NO 


* 3 6 3 ) ; 


L.L-L.xLxtt.1 IvjLj 




if 


MO 


• 1 64 > * 


CCTGATTGGC 


(SEQ 


ID 


NO 


:365) ; 


L X \JX\ X X vJOv«.r^ 


( SEO 

V ^ J— 1 w 


ID 


NO 


: 3 6 6 ) ; 


TGATTGGCAG 


(SEQ 


ID 


NO 


:367) ; 


GATTOGPAGA 


(SEO 


ID 


NO 


: 3 6 8 ) ; 


ATTGGCAGAA 


(SEQ 


ID 


NO 


:369) ; 


ttpppaoaap 


f SEO 


ID 


NO 


:370) ; 


TGGCAGAACT 


(SEQ 


ID 


NO 


:371) ; 






TT) 

XU 




• 172 ^ * 


GCAGAACTAC 


(SEQ 


ID 


NO 


:373) ; 


CAGAACTACA 


(bbQ 


XL) 


"MP 

IN\J 


! J / ^i J / 


AHAAPTACAG 


(SEO 


ID 


NO 


:375) ; 


GAACTACACA 


/ P?T?/"\ 

(SEQ 


XU 


vtpa 
XnU 


: j / 0 } f 


A APT A PAP AP 


(SEO 


ID 


NO 


-377) • 


ACTACACACC 


(SEQ 


ID 


NO 


: 378 ) ; 


/*""T>A OA ^~*A f~*r~* A 


( 


X U 


IN W 


• 3 f y } , 


TACACACCAG 


(SEQ 


ID 


NO 


:380) ; 


ACACACCAGG 


(SEQ 


ID 


NO 


:381) ; 


CACACCAGGG 


(SEQ 


ID 


NO 


:382) ; 


AC AC CAGGG C 


- (SEQ 


ID 


NO 


:383) ; 


CACCAGGGCC 


(SEQ 


ID 


NO 


:384) ; 


ACCAGGGCCA 


(SEQ 


ID 


NO 


:385) ; 


CCAGGGCCAG 


(SEQ 


ID 


NO 


:386) ; 


CAGGGCCAGG 


(SEQ 


ID 


NO 


:387) ; 


AGGGCCAGGG 


(SEQ 


ID 


NO 


:388) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO 


:389) ; 
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GGCCAGGGGT (SEQ ID NO: 3 90); 
CCAGGGGTCA (SEQ ID NO : 3 9 2 ) ; 
AGGGGTCAGA (SEQ ID NO: 3 94); 
GGGTCAGATA (SEQ ID NO : 3 9 6 ) ; 
GTCAGATATC (SEQ ID NO: 3 98); 
CAGATATCCA (SEQ ID NO:400); 
GATATCCACT (SEQ ID NO: 4 02); 
TATCCACTGA (SEQ ID NO:404); 
TCCACTGACC (SEQ ID NO:406); 
CACTGACCTT (SEQ ID NO: 408); 
CTGACCTTTG (SEQ ID NO: 4 10); 
GACCTTTGGA (SEQ ID NO.-412); 
CCTTTGGATG (SEQ ID NO: 4 14); 
TTTGGATGGT (SEQ ID NO: 4 16) ; 
TGGATGGTGC (SEQ ID NO : 4 1 8 ) ; 
GATGGTGCTA (SEQ ID NO: 420); 
TGGTGCTACA ( SEQ ID NO : 4 2 2 ) ; 
GTGCTACAAG (SEQ ID NO: 424); 
GCTACAAGCT (SEQ ID NO:426); 
TACAAGCTAG ( SEQ ID NO : 4 2 8 ) ; 
CAAGCTAGTA ( SEQ ID NO : 4 3 0 ) ; 
AGCTAGTACC (SEQ ID NO: 432) ; 
CTAGTACCAG (SEQ ID NO: 434) ; 
AGTAC CAGTT (SEQ ID NO: 43 6) ; 
TACCAGTTGA ( SEQ ID NO : 4 3 8 ) ; 
CCAGTTGAGC ( SEQ ID NO : 4 4 0 ) ; 
AGTTGAGCCA ( SEQ ID NO : 4 4 2 ) ; 
TTGAG CCAGA (SEQ ID NO : 444 ) ; 
GAGCCAGATA ( SEQ ID NO : 4 4 6 ) ; 
GCCAGATAAG ( SEQ ID NO : 4 4 8 ) ; 
CAGATAAGGT (SEQ ID NO: 45 0) ; 
GATAAGGTAG (SEQ ID NO: 452); 
TAAGGTAGAA (SEQ ID NO: 4 54); 
AGGTAGAAGA (SEQ ID NO: 456) ; 
GTAGAAGAGG ( SEQ ID NO : 4 5 8 ) ; 
AGAAGAGGCC (SEQ ID NO: 460); 
AAGAGGCCAA (SEQ ID NO: 462); 
GAGGCCAATA (SEQ ID NO: 4 64); 



206 - 

GCCAGGGGTC (SEQ ID NO:391); 
CAGGGGTCAG ( SEQ ID NO : 3 93 ) ; 
GGGGTCAGAT ( SEQ ID NO : 3 95 ) ; 

GGTCAGATAT ( SEQ ID NO : 3 9 7 ) ; 

TCAGATATCC (SEQ ID NO : 3 99 ) ; 

AGATATCCAC (SEQ ID NO: 4 01) ; 

ATATCCACTG (SEQ ID NO:403); 

ATCCACTGAC (SEQ ID NO: 4 05) ; 

CCACTGACCT (SEQ ID NO: 4 07) ; 

ACTGACCTTT (SEQ ID NO: 409) ; 

TGACCTTTGG (SEQ ID N0:411); 

ACCTTTGGAT (SEQ ID NO:413); 

CTTTGGATGG (SEQ ID NO:415); 

TTGGATGGTG (SEQ ID NO: 417) ; 

GGATGGTGCT (SEQ ID NO: 4 19) ; 

ATGGTGCTAC (SEQ ID NO: 421) ; 

GGTGCTACAA (SEQ ID NO:423); 

TGCTACAAGC (SEQ ID NO:425); 

CTACAAGCTA (SEQ ID NO: 42 7) ; 

ACAAGCTAGT (SEQ ID NO: 42 9) ; 

AAGCTAGTAC ( SEQ ID NO : 4 3 1 ) ; 

GCTAGTACGA (SEQ ID NO:433); 

TAGTACCAGT ( SEQ ID NO : 4 3 5 ) ; 

GTACCAGTTG (SEQ ID NO: 43 7) ; 
ACCAGTTGAG (SEQ ID NO: 43 9) ; 

CAGTTGAGCC (SEQ ID NO: 441) ; 
GTTGAGCCAG (SEQ ID NO:443); 
TGAGCCAGAT (SEQ ID NO:445); 
AGCCAGATAA (SEQ ID NO: 44 7) ; 
CCAGATAAGG (SEQ ID NO: 44 9) ; 
AGATAAGGTA (SEQ ID NO:451); 
ATAAGGTAGA (SEQ ID NO: 4 53) ; 
AAGGTAGAAG (SEQ ID NO: 4 55) ; 
GGTAGAAGAG ( SEQ ID NO : 4 5 7 ) ; 
TAGAAGAGGC (SEQ ID NO:459); 
GAAGAGGCCA (SEQ ID NO: 461) ; 
AGAGGCCAAT (SEQ ID NO:463); 
AGG CCAATAA (SEQ ID NO: 465) ; 
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GGCCAATAAA 


(SEQ 


ID 


NO : 


466) , 


CC AATAAAGG 


(SEQ 


ID 


NO : 


468) ^ 


AATAAAGGAG 


(SEQ 


ID 


NO : 


47 0) j 


TAAAGGAGAG 


(SEQ 


ID 


NO : 


4 7 2 ) 


AAGGAGAGAA 


(SEQ 


ID 


XT A 

NO : 


A 1 A \ 

4 /4 ) 


GGAGAGAACA 


( SEQ 


ID 


NO : 


4 / b / 


AGAGAACACC 


(SEQ 


TTfc 
LU 


INU : 


*± / O ) 






X U 


Krn • 

ri\J » 


a r n ) 

*± O \J J 


AALALUftbL 1 




XJJ 


wn • 




CACCAGCTTG 


/ O T? A 

(SEQ 


T T\ 

lu 


\Tr\ . 

NU : 


A Q A \ 
4 o 4 ) 


CCAGCTTGTT 


/ PDA 

(SEQ 


ID 


XT A . 

NO : 


4 o b j 


AGCTTGTTAC 


(SEQ 


ID 


NO: 


488) 


CTTGTTACAC 


(SEQ 


ID 


NO: 


490) 


TGTTACACCC 


(SEQ 


ID 


NO: 


492) 


TTACACCCTG 


(SEQ 


ID 


NO: 


494 ) 


ACACCCTGTG 


(SEQ 


ID 


NO : 


496 ) 


ACCCTGTGAG 


(SEQ 


ID 


NO : 


498 ) 


CCTGTGAGCC 


(SEQ 


ID 


XT A 

NO : 


50 0) 


TGTGAGCCTG 


(SEQ 


ID 


NO : 


502) 


TGAGCCTGCA 


(SEQ 


ID 


NO 


504) 


AGCCTGCATG 


(SEQ 


ID 


NO 


•506) 


CCTGCATGGA 


(SEQ 


ID 


NO 


:508) 


TGCATGGAAT 


(SEQ 


ID 


NO 


:510) 


CATGGAATGG 


(SEQ 


ID 


NO 


:512) 


TGGAATGGAT 


(SEQ 


ID 


NO 


: 514) 


GAATGGATGA 


(SEQ 


ID 


NO 


: 516 ) 


ATGGATGACC 


(SEQ 


ID 


NO 


: 518 ) 


GGATGACCCT 


(SEQ 


ID 


NO 


: 520 ) 


ATGACCCTGA 


(SEQ 


ID 


NO 


: 522) 


GACCCTGAGA 


(SEQ 


ID 


NO 


: 524) 


CCCTGAGAGA 


(SEQ 


ID 


NO 


: 526) 


CTGAGAGAGA 


(SEQ 


ID 


NO 


: 528 ) 


GAGAGAGAAG 


(SEQ 


ID 


NO 


:530) 


GAGAGAAGTG 


(SEQ 


ID 


NO 


:532) 


GAGAAGTGTT 


(SEQ 


ID 


NO 


:534) 


GAAGTGTTAG 


(SEQ 


ID 


NO 


:536) 


AGTGTTAGAG 


(SEQ 


ID 


NO 


:538) 


TGTTAGAGTG 


(SEQ 


ID 


NO 


:540) 
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GCCAATAAAG (SEQ ID NO: 4 67); 
CAATAAAGGA (SEQ ID NO:469); 
ATAAAGGAGA (SEQ ID NO: 471) ; 
AAAGGAGAGA (SEQ ID NO: 473) ; 
AGGAGAGAAC (SEQ ID NO: 475) ; 
GAGAGAACAC (SEQ ID NO: 477) ; 
GAGAACACCA (SEQ ID NO: 47 9) ; 
GAACACCAGC (SEQ ID NO: 481); 
ACACCAGCTT (SEQ ID NO: 483) ; 
ACCAGCTTGT (SEQ ID NO: 485) ; 
CAGCTTGTTA (SEQ ID NO: 487) ; 
GCTTGTTACA (SEQ ID NO: 48 9) ; 
TTGTTACACC (SEQ ID NO: 491) ; 
GTTACACCCT (SEQ ID NO: 493) ; 
TACACCCTGT (SEQ ID NO: 4 95) ; 
CACCCTGTGA (SEQ ID NO: 4 97) ; 
CCCTGTGAGC (SEQ ID NO: 499) ; 
CTGTGAGCCT ( SEQ ID NO : 5 0 1 ) ; 
GTGAGCCTGC (SEQ ID NO: 5 03); 
GAGCCTGCAT (SEQ ID NO: 5 05); 
GCCTGCATGG (SEQ ID NO: 507) ; 
CTGCATGGAA (SEQ ID NO: 50 9) ; 
GCATGGAATG (SEQ ID NO: 511) ; 
ATGGAATGGA (SEQ ID NO: 513) ; 
GGAATGGATG (SEQ ID NO: 515) ; 
AATGGATGAC (SEQ ID NO : 517) ; 
TGGATGACCC (SEQ ID NO: 519); 
GATGACCCTG (SEQ ID NO: 521) ; 
TGACCCTGAG (SEQ ID NO: 523); 
ACCCTGAGAG (SEQ ID NO: 525); 
CCTGAGAGAG (SEQ ID NO: 527); 
TGAGAGAGAA (SEQ ID NO: 529); 
AGAGAGAAGT (SEQ ID NO: 531) ; 
AGAGAAGTGT (SEQ ID NO : 5 3 3 ) ; 
AGAAGTGTTA (SEQ ID NO: 53 5) ; 
AAGTGTTAGA (SEQ ID NO: 537); 
GTGTTAGAGT (SEQ ID NO: 539) ; 
GTTAGAGTGG (SEQ ID NO: 541); 
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TTAGAGTGGA (SEQ ID NO: 542) 
AGAGTGGAGG (SEQ ID NO: 544) 
AGTGGAGGTT (SEQ ID NO: 54 6) 
TGGAGGTTTG (SEQ ID NO: 548) 
GAGGTTTGAC ( SEQ ID NO : 5 5 0 ) 
GGTTTGACAG (SEQ ID NO: 5 52) 
TTTGACAGCC (SEQ ID NO: 554) 
TGACAGCCGC (SEQ ID NO: 556) 
ACAGCCGCCT (SEQ ID NO: 558) 
AGCCGCCTAG (SEQ ID NO: 560) 
CCGCCTAGCA (SEQ ID NO: 562) 
GCCTAGCATT (SEQ ID NO: 564) 
CTAGCATTTC (SEQ ID NO: 5 66) 
AGCATTTCAT (SEQ ID NO: 568) 
CATTTCATCA (SEQ ID NO: 570) 
TTTCATCACG (SEQ ID NO: 572) 
TCATCACGTG (SEQ ID NO: 574) 
ATCACGTGGC (SEQ ID NO: 576) 
CACGTGGCCC (SEQ ID NO: 5 78) 
CGTGGCCCGA (SEQ ID NO: 580) 
TGGCCCGAGA (SEQ ID NO: 582) 
GCGCGAGAGC (SEQ ID NO: 5 84) 
CCGAGAGCTG (SEQ ID NO: 586) 
GAGAGCTGCA (SEQ ID NO : 5 8 8 ) 
GAGCTGCATC (SEQ ID NO: 5 90) 
GCTGCATCCG (SEQ ID NO:592) 
TGCATCCGGA (SEQ ID NO:594), 
CAT C CGGAGT (SEQ ID NO: 5 96), 
TCCGGAGTAC (SEQ ID NO: 598), 
CGGAGTACTT ( SEQ ID NO : 6 0 0 ) , 
GAGTACTTCA (SEQ ID NO : 602 )' ; 
GTACTTCAAG (SEQ ID NO: 6 04); 
ACTTCAAGAA (SEQ ID NO: 606); 
TTCAAGAACT (SEQ ID NO: 608); 
CAAGAACTGC (SEQ ID NO:610); 
AGAACTGCTG (SEQ ID NO: 612); 
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TAGAGTGGAG (SEQ ID NO: 543) 
GAGTGGAGGT (SEQ ID NO: 54 5) 
GTGGAGGTTT (SEQ ID NO: 547) 
GGAGGTTTGA (SEQ ID NO: 54 9) 
AGGTTTGACA (SEQ ID NO: 551) 
GTTTGACAGC (SEQ ID NO: 553) 
TTGACAGCCG (SEQ ID NO: 555) 
GACAGCCGCC (SEQ ID NO: 55 7) 
CAGCCGCCTA (SEQ ID NO: 559) 
GCCGCCTAGC (SEQ ID NO:561) 
CGCCTAGCAT (SEQ ID NO: 563), 
CCTAGCATTT (SEQ ID NO:565), 
TAGCATTTCA (SEQ ID NO: 567) , 
GCATTTCATC (SEQ ID NO:569); 

ATTTCATCAC (SEQ ID NO: 571); 
TTCATCACGT (SEQ ID NO:573); 
CATCACGTGG (SEQ ID NO: 575); 

TCACGTGGCC (SEQ ID NO: 577) ; 

ACGTGGCCCG (SEQ ID NO: 579) ; 

GTGGCCCGAG (SEQ ID NO: 581); 

GGCCCGAGAG (SEQ ID NO: 583); 

CCCGAGAGCT (SEQ ID NO: 585); 

CGAGAGCTGC (SEQ ID NO: 5 87); 

AGAGCTGCAT (SEQ ID NO: 589); 

AGCTGCATCC (SEQ ID NO: 5 91); 

CTGCATCCGG (SEQ ID NO: 5 93); 

GCATCCGGAG (SEQ ID NO: 5 95); 

ATCCGGAGTA ( SEQ ID NO : 5 9 7 ) ; 

CCGGAGTACT (SEQ ID NO: 599); 

GGAGTACTTC (SEQ ID NO: 6 01); 

AGTACTTCAA (SEQ ID NO: 603); 

TACTTCAAGA (SEQ ID NO: 6 05); 

CTTCAAGAAC (SEQ ID NO: 6 07); 

TCAAGAACTG (SEQ ID NO:609); 

AAGAACTGCT (SEQ ID NO: 6 11); 

GAACTGCTGA (SEQ ID NO: 613) . 
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33. A method according to claim 27 wherein said deletion encompasses at least one 



oi ine ioiiowing 


decanucleotides from the LTR region of HIV- 1 strain NL4-3: 


GCTTTTTGCC 


(SEQ 


ID 


NO : 


6 52; ; 


LI J. 1 1 1 1 




ID 

X LJ 


NO : 


6 53) ; 


TTTTTGCCTG 


(SEQ 


ID 


NU : 


bb4 ) ; 


111 luLLivJl 


(SEO 


ID 


NO :* 


655) ; 




(SEQ 


ID 


NO : 


odd) ; 


1 1 LjLL, 1 \J 




TD 

X 1— ' 


NO: 


6 57) ; 


TGCCTGTACT 


(SEQ 


ID 


NO : 


6 5 8) ; 


(jL-L- 1 blAU 1 ^7 




TD 


NO : 


659) • 


CCTGTACTGG 


(SEQ 


ID 


NO : 


66 0) ; 


i\j I AL 1 bbu 




TD 

X lv 


NO * 


661) * 


TGTACTGGGT 


(SEQ 


ID 


NO : 


6 62 } ; 


1 AL 1 1 L 




TD 


NO : 


663 ) - 


TACTGGGT CT 


(SEQ 


ID 


NO : 


6 64) ; 


AC- ICjjLjCj iLlt 




TD 
Xx^ 


NO • 


665) • 


CTGGGTCTCT 


(SEQ 


ID 


NO : 


6 66) ; 


t* t~* t* r 1 t* hp f* 




TD 
X u 


NO • 


667) • 


GGGTCTCTCT 


(SEQ 


ID 


NO : 


6 68) / 


/"i /"^l m /^t rrn /""i Ti ^ T* z" 1 

GGTCTC1 L ICj 


/ CT70 


Tn 


MO « 

INV^ - 


D D y } f 


GTCTCTCTGG 


(SEQ 


ID 


NO: 


670) ; 


TCTCTCTGGT 


(SEQ 


ID 


1NU : 


o / 1 ) ; 


CTCTCTGGTT 


(SEQ 


ID 


NO: 


672) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO : 


673) ; 


CTCTGGTTAG 


(SEQ 


ID 


NO: 


674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO : 


o / b ; ; 


CTGGTTAGAC 


(SEQ 


ID 


NO: 


676) ; 


TGGTTAGACC 


(SEQ 


ID 


NO : 


b / / ) ; 


GGTTAGACCA 


(SEQ 


ID 


NO: 


678) ; 


GTTAGACCAG 


(SEQ 


ID 


NU : 




TTAGACCAGA 


(SEQ 


ID 


NO: 


680) ; 


TAGACCAGAT 


(SEQ 


ID 


JNU : 


b o JL ) , 


AGACCAGATC 


(SEQ 


ID 


NO: 


682) ; 


GACCAGATCT 


(SEQ 


ID 


NU : 




ACCAGATCTG 


(SEQ 


ID 


NO: 


684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO : 


685) ; 


CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO: 


687) ; 


GATCTGAGCC 


(SEQ 


ID 


NO: 


688) / 


ATCTGAGCCT 


(SEQ 


ID 


NO : 


689) ; 


TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO : 


6 91) ; 


TGAGCCTGGG 


(SEQ 


ID 


NO' 


692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO : 


6 93) ; 


AGCCTGGGAG 


(SEQ 


ID 


NO 


:694) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO 1 




CCTGGGAGCT 


(SEQ 


ID 


NO 


:696) ; 


CTGGGAGCTC 


(SEQ 


in 
ID 


INiU 


: b y / ) ; 


TGGGAGCTCT 


(SEQ 


ID 


NO 


: 698) ; 


GGGAGCTCTC 


(SEQ 


T T"\ 

ID 


NO 


: 6 9 9 ) ; 


GGAGCTCTCT 


(SEQ 


ID 


NO 


:700) ; 


GAGCTCTCTG 


(SCiQ 


XL) 




: /Ul) , 


AGCTCTCTGG 


(SEQ 


ID 


NO 


:702) ; 


GCTCTCTGGC 


(SEQ 


ID 


NU 


: vui ) ; 


CTCTCTGGCT 


(SEQ 


ID 


NO 


:704) ; 


TCTCTGGCTA 


(SEQ 


ID 


NO 


:705) ; 


CTCTGGCTAA 


(SEQ 


ID 


NO 


:706) ; 


TCTGGCTAAC 


(SEQ 


ID 


NO 


:707) ; 


CTGGCTAACT 


(SEQ 


ID 


NO 


:708) ; 


TGGCTAACTA 


(SEQ 


ID 


NO 


:709) ; 


GGCTAACTAG 


(SEQ 


ID 


NO 


:710) ; 


GCTAACTAGG 


(SEQ 


ID 


NO 


:711) ; 


CTAACTAGGG 


(SEQ 


ID 


NO 


:712) ; 


TAACTAGGGA 


(SEQ 


ID 


NO 


:713) ; 


AACTAGGGAA 


(SEQ 


ID 


NO 


:714) ; 


ACTAGGGAAC 


(SEQ 


ID 


NO 


:715) ; 


CTAGGGAACC 


(SEQ 


ID 


NO 


: 716) ; 


TAGGGAACCC 


(SEQ 


ID 


NO 


:717) ; 


AGGGAACCCA 


(SEQ 


ID 


NO 


:718) ; 


GGGAACCCAC 


(SEQ 


ID 


NO 


:719) ; 
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kjL7AAv_-*wUAL. 1 


(bfc,(J 


XU 


NO 


^ *"7 *"i r\ \ 

t 7 20 ) 






XU 


"NT Pi 


: 122) 






Tn 

XL) 


INC 


I / 2*± ) 




/ G "PP. 


T T\ 
XU 




: f 2 b ; 




/ CTTP 


T T\ 

, ID 


XTP 

NU 


: 728) 




(SEQ 


ID 


NO 


: 730) 


r P r P A A P PPTPA 


(SEQ 


ID 


NO 


: 732 ) 


AAoCC i CAAi 


/ O TT 1 /""A 

(SEQ 


ID 


NO 


: 734 ) 


uLl J. LAA1AA 


(SEQ 


ID 


NO 


: 736 ) 


pfrpA ATA A A P 

1 LAA 1 AAAG 


(SEQ 


ID 


NO 


: 738) 


r"» A A •"PA 7\ 7\ O T"t 

GAA I AAAGCT 


(SEQ 


ID 


NO 


: 740) 


A I AAAG CTTG 


(SEQ 


ID 


NO 


: 742) 


AAAGGTTGCC 


(SEQ 


ID 


NO 


: 744 ) 


AbL 1 1 GCC I T 


(SEQ 


ID 


NO 


: 746) 


CI J.GCCI 1GA 


(SEQ 


ID 


NO 


: 748 ) 


1 <jCC 1 1 GAGT 


(SEQ 


ID 


NO 


: 750) 


CC 1 1 GAG I GC 


(SEQ 


ID 


NO 


. 752) 


T"PP A P TP / r t r P f T ' 

I 1 GAG 1 GC 1 x 


(SEQ 


ID 


NO 


754) 


GAGTGCTTCA 


(SEQ 


ID 


NO 


756) 


G1GG1 rCAAG 


(SEQ 


ID 


NO 


758) 


GC 1 1 GAAG I A 


(SEQ 


ID 


NO : 


760) 


rTCAAGTAGT 


(SEQ 


ID 


NO : 


762) 


PA APT 1 A n r P/*"Ti 

GAAG I AG I GT 


(SEQ 


ID 


NO : 


764 ) 


AG 1 AG 1 G 1 GT 


(SEQ 


ID 


NO : 


766) 


1 AG lGTGTGC 


(SEQ 


ID 


NO : 


768) 


/~> rp r*\ rp/~i m/T j*t 

GTGTGTGCCC 


(SEQ 


ID 


NO : 


770) 


G rGTGCCCGT 


(SEQ 


ID 


NO: 


772) 


G1GCCCGTCT 


(SEQ 


ID 


NO : 


774) , 


GGGGG 1 G 1 GT. 


(SEQ 


ID 


NO : 


776) , 


PPPTPTPT'PP 
V-^wLj j. ^- 1 I i C 


(SEQ 


ID 


NO : 


778) J 


T 1 P TTZ TTP T 1 P 


(bhiQ 


ID 


NO : 


780) , 


c ir vrz r v r m'vr"vr* 


/ OTP 


ID 


NO : 


782) t 


f3 r r r rriT , f2 r t , r* a r 1 

1 U ib 1 LjAC 


(SLQ 


T T"\ 

ID 


NO : 


784 ) , 


iulul Vji-iL. 1 C 


/ OTTP 

( ohi^ 


Tt\ 

ID 


NO : 


786) , 


I G 1 GTGACTC 


(SEQ 


ID 


NO : 


788) , 


TGTGACTCTG 


(SEQ 


ID 


NO: 


790) , 


TGACTCTGGT 


(SEQ 


ID 


NO: 


792) ; 


ACTCTGGTAA 


(SEQ 


ID 


NO: 


794) ; 



0 - 



GAAC C CACTG 


(SEQ 


ID 


NO 


: 721 ) 


A PPP A PTPfT 1 

ALLLAL i GG i 


(SEQ 


ID 


NO 


: 723 ) 




(bhU 


ID 


"MP 

NU 


: /2b) 


A PTP P r P r T 1 A A P 

AGiGGl 1AAG 


(SEQ 


T T"\ 

ID 


NO 


: 727) 


TGCTTAAGGC 


(SEQ 


ID 


NO 


: 729 ) 


CTTAAGCCTC 


(SEQ 


ID 


NO 


: 731) 


TAAGCCTCAA 


(SEQ 


ID 


NO 


: 733 ) 


AGCCTCAATA 


(SEQ 


ID 


NO 


: 735) 


CCTCAATAAA 


(SEQ 


ID 


NO 


: 737) 


TCAATAAAGC 


(SEQ 


ID 


NO 


: 739) 


AATAAAGCTT 


(SEQ 


ID 


NO 


: 741) 


TAAAGCTTGC 


(SEQ 


ID 


NO 


: 743) 


AAGCTTGCCT 


(SEQ 


ID 


NO 


: 745) 


GCTTGCCTTG 


(SEQ 


ID 


NO 


: 747) 


TTGCCTTGAG 


(SEQ 


ID 


NO 


. 749) 


GCCTTGAGTG 


(SEQ 


ID 


NO 


751) 


CTTGAGTGCT 


(SEQ 


ID 


NO 


753 ) 


TGAGTGCTTC 


(SEQ 


ID 


NO : 


755) 


AGTGCTTCAA 


(SEQ 


ID 


NO: 


757) 


TGCTTCAAGT 


(SEQ 


ID 


NO: 


759) 


CTTCAAGTAG 


(SEQ 


ID 


NO: 


761) 


TCAAGTAGTG 


(SEQ 


ID 


NO: 


763) 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


765) 


GTAGTGTGTG 


(SEQ 


ID 


NO: 


767) 


AGTGTGTGCC 


(SEQ 


ID 


NO : 


769) 


TGTGTGCCCG 


(SEQ 


ID 


NO: 


771) 


TGTGCCCGTC 


(SEQ 


ID 


NO: 


773) 


TGCCCGTCTG 


(SEQ 


ID 


NO: 


775) 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


111) 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) 


1 CTGTTGTGT 


(SEQ 


ID 


NO : 


781) 


TGTTGTGTGA 


(SEQ 


ID 


NO : 


783 ) 


rpmprp/impT. prp 

1 1G1G1GACT 


(SEQ 


ID 


NO : 


785) 


PT'P'T^P A OrT»r^rp 

G 1 G I GAG 1 CI 


(SEQ 


ID 


NO : 


787) 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) 


GACTCTGGTA 


(SEQ 


ID 


NO: 


793) 


CTCTGGTAAC 


(SEQ 


ID 


NO: 


795) 
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TCTGGTAACT (SEQ ID NO:796); CTGGTAACTA (SEQ ID NO:797); 
TGGTAACTAG (SEQ ID NO: 798) ; GGTAACTAGA (SEQ ID NO: 799) . 

34. A method according to claim 24 wherein said HIV-1 isolate has the identifying 
characteristics of HIV-1 isolate ECACC V94101706 or HIV-1 isolate ECACC 
V94 1031 169 or isolate C54 deposited at ECACC on 14 February, 1995 under Accession 
Number . 

35. A method according to claim 20 wherein the HIV-1 isolate is reactive to 
antibodies to a glycoprotein of HIV-1, is capable of inducing an immune response of 
gag, pol and/or env and which is incapable of directing synthesis of a we/ gene product 
or a full length nef gene product. 

36. A method according to claim 35 wherein the glycoprotein is at least one of gp41- 
45, gpl20 and/or gpl60. 

37. A method for vaccinating an individual against the development of AIDS or 
AIDS related diseases, said method comprising administering to said individual a non- 
pathogenic isolate of HIV-1 in an amount effective to infect target cells and to generate 
target cells carrying DNA derived from said non-pathogenic HIV-1. 

38. A method according to claim 37 wherein said isolate is capable of stimulating 
an immune response in a human or primate against at least one glycoprotein on HIV-1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected human or primate 
subject. 

39. A method according to claim 38 wherein said isolate carries one or more 
mutations in its genome resulting in the inability to direct synthesis of at least one 
polypeptide or protein from a pathogenic strain of HIV-1. 
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40. A method according to claim 17 wherein said isolate carried one or more 
mutations in its genome resulting in said genome directing synthesis of a truncated form 
of a polypeptide or protein from a pathogenic strain of HIV- 1 . 

41. A method according to claim 39 or 40 wherein said HIV-1 isolate carries a 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HIV-1 genome. 

42. A method according to claim 40 wherein said HIV-1 isolate is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nef gene and/or LTR region 
of pathogenic HIV-1. 

43. A method according to claim 42 wherein said HIV-1 isolate is capable of 
inducing an immune response to at least one of gag, pol and/or env. 

44. A method according to claim 43 wherein said HIV-1 isolate carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

45. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (i) 8830-8862; 



46. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 



(ii) 

(iii) 

(iv) 



9033-9049. 



9019-9029 



9009-9035 



and 



nucleotide 



(v) 
(vi) 



9281-9371; 
9281-9362; 
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(vii) 9105-9224; and 

(viii) ; 9271-9370. 

47. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

47. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



48. A method according to claim 44 wherein said deletion encompasses at least one 
of the following decanucleotides from the nef gene of HIV-1 strain NL43: 



ATGGGTGGCA 


(SEQ 


ID 


NO: 


2) ; 


TGGGTGGCAA 


(SEQ ID 


NO: 


3) ; 




GGGTGGCAAG 


(SEQ 


ID 


NO: 


4) ; 


GGTGGCAAGT 


(SEQ ID 


NO: 


5) ; 




GTGGCAAGTG 


(SEQ 


ID 


NO: 


6) ; 


TGGCAAGTGG 


(SEQ ID 


NO: 


7) ; 




GGCAAGTGGT 


(SEQ 


ID 


NO: 


8) ; 


GCAAGTGGTC 


(SEQ ID 


NO: 


9) ; 




CAAGTGGTCA 


(SEQ 


ID 


NO: 


10) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO: 


ID ; 


AGTGGTCAAA 


(SEQ 


ID 


NO: 


12) ; 


GTGGTCAAAA 


(SEQ 


ID 


NO: 


13) ; 


TGGTCAAAAA 


(SEQ 


ID 


NO: 


14) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO: 


15) ; 


GTCAAAAAGT 


(SEQ 


ID 


NO: 


16) ; 


TCAAAAAGTA 


(SEQ 


ID 


NO: 


17) ; 


CAAAAAGTAG 


(SEQ 


ID 


NO: 


18) ; 


AAAAAGTAGT 


(SEQ ID 


NO 


19) 




AAAAGTAGTG 


(SEQ 


ID 


NO: 


20) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO: 


21) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


22) ; 


AGTAGTGTGA 


(SEQ 


ID 


NO: 


23) ; 


GTAGTGTGAT 


(SEQ 


ID 


NO: 


24) ; 


TAGTGTGATT 


(SEQ ID 


i NO 


:25) 




AGTGTGATTG 


(SEQ 


ID 


NO: 


26) ; 


GTGTGATTGG 


(SEQ 


ID 


NO: 


27) ; 


TGTGATTGGA 


(SEQ 


ID 


NO: 


28) ; 


GTGATTGGAT 


(SEQ 


ID 


NO' 


29) ; 


TGATTGGATG 


(SEQ 


ID 


NO: 


30) ; 


GATTGGATGG 


(SEQ 


ID 


NO 


:31) ; 


ATTGGATGGC 


(SEQ 


ID 


NO: 


32) ; 


TTGGATGGCC 


(SEQ 


ID 


NO 


= 33) ; 


TGGATGGCCT 


(SEQ 


ID 


NO 


34) ; 


GGATGGCCTG 


(SEQ 


ID 


NO 


:35) ; 
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GATGGCCTGC (SEQ ID NO: 36); 
TGGCCTGCTG (SEQ ID NO: 38); 
GCCTGCTGTA (SEQ ID NO: 40); 
CTGCTGTAAG (SEQ ID NO:42); 

GCTGTAAGGG (SEQ ID NO: 44); 

TGTAAGGGAA (SEQ ID NO: 46); 

TAAGGGAAAG (SEQ ID NO: 48) ; 

AGGGAAAGAA (SEQ ID NO: 50); 

GGAAAGAATG ( SEQ ID NO : 5 2 ) ; 

AAAGAATGAG (SEQ ID NO: 54); 

AGAATGAGAC (SEQ ID NO: 56); 

AATGAGACGA (SEQ ID NO: 58); 

TGAGACGAGC (SEQ ID NO: 60) ; 

AGACGAGCTG (SEQ ID NO: 62); 

ACGAGCTGAG (SEQ ID NO: 64); 

GAGCTGAGCC (SEQ ID NO : 66) ; 

GCTGAGCCAG (SEQ ID NO: 68); 

TGAGC CAGCA (SEQ ID NO: 70); 

AGCCAGCAGC (SEQ ID NO: 72) ; 

CCAGCAGCAG (SEQ ID NO: 74); 

AGCAGCAGAT (SEQ ID NO: 76); 

CAGCAGATGG (SEQ ID NO: 78); 

GCAGATGGGG (SEQ ID NO: 8 0); 
AGATGGGGTG ( SEQ ID NO : 8 2 ) ; 
ATGGGGTGGG (SEQ ID NO: 84); 
GGGGTGGGAG (SEQ ID NO: 86); 
GGTGGGAGCA (SEQ ID NO: 88); 
TGGGAGCAGT (SEQ ID NO: 90); 
GGAGCAGTAT (SEQ ID NO: 92); 
AGCAGTATCT (SEQ ID NO: 94); 
CAGTATCTCG (SEQ ID NO: 96); 
GTATCTCGAG (SEQ ID NO : 98 ) ; 
ATCTCGAGAC (SEQ ID NO: 100); 
CTCGAGACCT (SEQ ID NO: 102); 
CGAGACCTAG (SEQ ID NO:104); 
AGACCTAGAA (SEQ ID NO: 106); 
ACCTAGAAAA (SEQ ID NO: 108); 
CTAGAAAAAC (SEQ ID NO: 110); 
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ATGGCCTGCT 


(SEQ 


ID 


NO 


:37) ; 


GGCCTGCTGT 


(SEQ 


ID 


NO 


: 3 9 ) ; 


CCTGCTGTAA 


(SEQ 


ID 


NO 


:41) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO 


:43) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO 


:45) ; 


GTAAGGGAAA 


(SEQ 


ID 


NO 


:47) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO 


:49) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO 


:51) / 


GAAAGAATGA 


(SEQ 


ID 


NO 


:53) ; 


AAGAATGAGA 


(SEQ 


ID 


NO 


:55) ; 


GAATGAGACG 


(SEQ 


ID 


NO 


:57) ; 


ATGAGACGAG 


(SEQ 


ID 


NO 


:59) ; 


GAGACGAGCT 


(SEQ 


ID 


NO 


:61) ; 


GACGAGCTGA 


(SEQ 


ID 


NO 


i 63) ; 


CGAGCTGAGC 


(SEQ 


ID 


NO 


65) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO 


67) ; 


CTGAGCCAGC 


(SEQ 


ID 


NO- 


69) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO: 


71) ; 


GCCAGCAGCA 


(SEQ 


ID 


NO: 


73) ; 


CAG CAGCAG A 


(SEQ 


ID 


NO: 


75) ; 


GGAGCAGATG 


(SEQ 


ID 


NO: 


77) ; 


AGCAGATGGG 


(SEQ 


ID 


NO: 


79) ; 


CAGATGGGGT 


(SEQ 


ID 


NO: 


81) ; 


GATGGGGTGG 


(SEQ 


ID 


NO: 


83) ; 


TGGGGTGGGA 


(SEQ 


ID 


NO: 


85) ; 


GGGTGGGAGC 


(SEQ 


ID 


NO: 


87) ; 


GTGGG AG CAG 


(SEQ 


ID 


NO: 


89) ; 


GGGAGCAGTA 


(SEQ 


ID 


NO: 


91) ; 


GAGCAGTATC 


(SEQ 


ID 


NO: 


93) ; 


GCAGTATCTC 


(SEQ 


ID 


NO: 


95) ; 


AGTATCTCGA 


(SEQ 


ID 


NO: 


97) ; 


TATCTCGAGA 


(SEQ 


ID 


NO: 


99) ; 


TCTCGAGACC 


(SEQ 


ID 


NO: 101) ; 



TCGAGACCTA (SEQ ID NO: 103) 

GAGACCTAGA (SEQ ID NO: 105) 

GACCTAGAAA (SEQ ID NO: 107) 

CCTAGAAAAA (SEQ ID NO: 109) 

TAGAAAAACA (SEQ ID NO: 111) 
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AGAAAAACAT (SEQ ID NO: 112) 
AAAAACATGG (SEQ ID NO: 114) 
AAACATGGAG (SEQ ID NO: 116) 
ACATGGAGCA (SEQ ID NO: 118) 
ATGGAGCAAT (SEQ ID NO: 120) 
GGAGCAATCA (SEQ ID NO : 122 ) 
AGCAATCACA (SEQ ID NO: 124) 
CAATCACAAG (SEQ ID NO: 126) 
ATCACAAGTA (SEQ ID NO: 12 8) 
CACAAGTAGC (SEQ ID NO: 13 0) 
CAAGTAGCAA (SEQ ID NO: 13 2) 
AGTAGCAATA (SEQ ID NO: 134) 
TAGCAATACA (SEQ ID NO: 136) 
GCAATACAGC (SEQ ID NO: 13 8) 
AATACAGCAG (SEQ ID NO: 140) 
TACAGCAGCT (SEQ ID NO: 142) 
CAGCAGCTAA (SEQ ID NO: 144) 
GCAGCTAACA (SEQ ID NO: 14 6) 
AGCTAACAAT (SEQ ID NO: 14 8) 
CTAACAATGC (SEQ ID NO: 150) 
AACAATGCTG (SEQ ID NO: 152) 
CAATGCTGCT (SEQ ID NO : 1 5 4 ) 
ATGCTGCTTG (SEQ ID NO: 156) 
GCTGCTTGTG (SEQ ID NO: 158) 
TGCTTGTGCC (SEQ ID NO: 160) 
CTTGTGCCTG (SEQ ID NO: 162) 
TGTGCCTGGC (SEQ ID NO: 164) 
TGCCTGGCTA (SEQ ID NO: 166) 
CCTGGCTAGA (SEQ ID NO: 16 8) 
TGGCTAGAAG (SEQ ID NO: 17 0) 
GCTAGAAGCA (SEQ ID NO: 172) 
TAGAAGCACA (SEQ ID NO: 174) 
GAAGCACAAG (SEQ ID NO: 176) 
AGCACAAGAG (SEQ ID NO: 178) 
CACAAGAGGA (SEQ ID NO: 18 0) 
CAAGAGGAGG (SEQ ID NO: 182) 
AGAGGAGGAA (SEQ ID NO: 184) 
AGGAGGAAGA (SEQ ID NO: 186) 



215 - 



GAAAAACATG 


(SEQ 


ID 


NO: 


113) , 


AAAACATGGA 


(SEQ 


ID 


NO: 


115) , 


AACATGGAGC 


(SEQ 


ID 


NO : 


117) , 


CATGGAGCAA 


(SEQ 


ID 


NO: 


119) , 


TGGAGCAATC 


(SEQ 


ID 


NO: 


121) , 


GAGCAATCAC 


(SEQ 


ID 


NO: 


123) , 


GCAATCACAA 


(SEQ 


ID 


NO: 


125) , 


AATCACAAGT 


(SEQ 


ID 


NO: 


127) 


TCACAAGTAG 


(SEQ 


ID 


NO: 


129) 


ACAAGTAGCA 


(SEQ 


ID 


NO: 


131) 


AAGTAGCAAT 


(SEQ 


ID 


NO: 


133) 


GTAGCAATAC 


(SEQ 


ID 


NO: 


135) 


AnPAATArAG 


(SEO 


ID 


NO : 


137) 


r k i\i\ r ri\ni\rzr'i\ 




TD 

-L XJ 


NO : 


139) 


r\ x nuriuurtVJv 


( SEO 


ID 


NO : 


141) 


A C AG C AG C T A 


(SEO 


ID 


NO : 


143 ) 


AGCAGCTAAC 


(SEQ 


ID 


NO: 


145) 


CAGCTAACAA 


(SEQ 


ID 


NO: 


147) 


GCTAACAATG 


(SEQ 


ID 


NO: 


149) 


TAACAATGCT 


(SEQ 


ID 


NO: 


151) 


ACAATGCTGC 


(SEO 

\ ^J xjsc 


ID 


NO: 


153) 


AATGCTGCTT 


(SEO 


ID 


NO : 


155) 


X v_7 V X VJV.*. X X V_J X 


(SEO 


ID 


NO : 


157) 


CTGCTTGTGC 


(SEO 


ID 


NO: 


159) 


nCTTGTGCCT 

X XwXwV—WX 


(SEO 


ID 


NO : 


161) 


X iUlOuLiuu 




ID 

XX/ 


NO : 


163 ) 


n r vrzr , r tr vn.rzr tr v 

\j x uLL x x 




TD 

X XJ 


NO : 


165 ) 






TD 

J. XJ 


NO • 

Xri \-/ 


1 67 ) 

X vJ / / 






TD 

X XJ 


NO 


16 9} 




f SEO 


ID 


NO 


: 171) 


CTAGAAGCAC 


(SEQ 


ID 


NO 


: 173) 


AGAAGCACAA 


(SEQ 


ID 


NO 


:175) 






TD 

X XJ 




* 1 77 ) 


GCACAAGAGG 


(SEQ 


ID 


NO 


:179) 


ACAAGAGGAG 


(SEQ 


ID 


NO 


:181) 


AAGAGGAGGA 


(SEQ 


ID 


NO 


:183) 


GAGGAGGAAG 


(SEQ 


ID 


NO 


:185) 


GGAGGAAGAG 


(SEQ 


ID 


NO 


:187) 
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GAGGAAGAGG (SEQ ID NO: 18 8) 
GGAAGAGGTG (SEQ ID NO: 190) 
AAGAGGTGGG (SEQ ID NO: 192) 
GAGGTGGGTT (SEQ ID NO: 194) 
GGTGGGTTTT (SEQ ID NO: 196) 
TGGGTTTTCC (SEQ ID NO: 198) 
GGTTTTCCAG (SEQ ID NO: 2 00) 
TTTTCCAGTC (SEQ ID NO: 2 02) 
TTCCAGTCAC (SEQ ID NO: 204) 
CCAGTCACAC (SEQ ID NO: 2 06) 
AGTCACACCT (SEQ ID NO: 2 08) 
TCACACCTCA (SEQ ID NO: 210) 
ACACCTCAGG (SEQ ID NO: 212) 
ACCTCAGGTA (SEQ ID NO: 2 14) 
CTCAGGTACC (SEQ ID NO: 216) 
CAGGTACCTT (SEQ ID NO: 218) 
GGTACCTTTA (SEQ ID NO: 220) 
TACCTTTAAG (SEQ ID NO: 222) 
CCTTTAAGAC (SEQ ID NO: 224) 
TTTAAGACCA (SEQ ID NO: 226) 
TAAGACCAAT (SEQ ID NO: 22 8) 
AGAC CAATGA (SEQ ID NO: 230), 
ACCAATGACT (SEQ ID 1*0:232), 
CAATGACTTA (SEQ ID NO: 234), 
ATGACTTACA (SEQ ID NO: 236); 
GACTTACAAG (SEQ ID NO : 2 3 8 ) ; 
CTTACAAGGC (SEQ ID NO: 24 0); 
TACAAGGCAG (SEQ ID NO : 2 4 2 ) ; 
CAAGGCAGCT (SEQ ID NO:244); 
AGGCAGCTGT ( SEQ ID NO : 2 4 6 ) ; 
GCAGCTGTAG (SEQ ID NO: 248); 
AGCTGTAGAT ( SEQ ID NO : 2 5 0 ) ; 
CTGTAGATCT (SEQ ID NO: 2 52); 
GTAGATCTTA (SEQ ID NO:254); 
AGATCTTAGC (SEQ ID NO: 2 56); 
ATCTTAGCCA (SEQ ID NO: 258); 
CTTAGCCACT (SEQ ID NO: 2 60); 
TAGCCACTTT (SEQ ID NO: 2 62); 



- 216 - 



AGGAAGAGGT 


(SEQ 


ID 


NO 


: 189) 


GAAGAGGTGG 


(SEQ 


ID 


NO 


: 191) 


AGAGGTGGGT 


(SEQ 


ID 


NO 


: 193) 


AGGTGGGTTT 


(SEQ 


ID 


NO 


: 195) 


GTGGGTTTTC 


(SEQ 


ID 


NO 


: 197) 


GGGTTTTCCA 


(SEQ 


ID 


NO 


: 199) 


GTTTTCCAGT 


(SEQ 


ID 


NO 


:201) 


TTTCCAGTCA 


(SEQ 


ID 


NO 


:203) 


TCCAGTCACA 


(SEQ 


ID 


NO 


: 205) 


CAGTCACACC 


(SEQ 


ID 


NO 


: 207) 


GTCACACCTC 


(SEQ 


ID 


NO 


: 209) 


CACACCTCAG 


(SEQ 


ID 


NO 


: 211) 


CACCTCAGGT 


(SEQ 


ID 


NO 


: 213) 


CCTCAGGTAC 


(SEQ 


ID 


NO 


:215) 


TCAGGTACCT 


(SEQ 


ID 


NO 


.217) 


AGGTACCTTT 


(SEQ 


ID 


NO 


219) 


GTACCTTTAA 


(SEQ 


ID 


NO 


221) 


ACCTTTAAGA 


(SEQ 


ID 


NO, 


223) 


CTTTAAGACC 


(SEQ 


ID 


NO: 


225) 


TTAAGACCAA 


(SEQ 


ID 


NO: 


227) 


AAGACCAATG 


(SEQ 


ID 


NO: 


229) 


GAC CAATGAC 


(SEQ 


ID 


NO: 


231) 


CCAATGACTT 


(SEQ 


ID 


NO : 


233) 


AATGACTTAC 


(SEQ 


ID 


NO : 


235) 


TGACTTACAA 


(SEQ 


ID 


NO: 


237) 


ACTTACAAGG 


(SEQ 


ID 


NO: 


239) 


TTACAAGGCA 


(SEQ 


ID 


NO: 


241) 


ACAAGGCAGC 


(SEQ 


ID 


NO : 


243) 


AAGGCAGCTG 


(SEQ 


ID 


NO: 


245) 


GGCAGCTGTA 


(SEQ 


ID 


NO: 


247) 


CAGCTGTAGA 


(SEQ 


ID 


NO : 


249) 


GCTGTAGATC 


(SEQ 


ID 


NO : 


251) 


TGTAGAT CTT 


(SEQ 


ID 


NO : 


253) 


1 AGAT CTTAG 


(SEQ 


ID 


NO ; 


255) 


GATCTTAGCC 


(SEQ 


ID 


NO: 


257) 


TCTTAGCGAC 


(SEQ 


ID 


NO: 


259) , 


TTAGCCACTT 


(SEQ 


ID 


NO: 


261) , 


AGCCACTTTT 


(SEQ 


ID 


NO: 


263) , 
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CZ C C A C TTTT T 

\J \ Vwii-^w- X X X X X 


( SEQ 


ID 


NO 


2 64 ) ; 


V^x-iV*. X X X X X jfXTt 




ID 

X LJ 


NO 

AX kJ 


266 } • 

tL* O O / / 


TTTTTTAAAA 
\_ x x x x x jviruTi 




TD 

X LJ 


NO 


\J \J ) / 


TTTTAAAAGA 




TD 

X Xv 


NO 


270 ) • 






TD 

-L LJ 


NO 




AAAAGAAAAG 


(SEQ 


ID 


NO 


2 74) ; 


AAGAAAAGGG 


(SEQ 


ID 


NO 


276) ; 


GAAAAGGGGG 


(SEQ 


ID 


NO- 


278) ; 


AAAGGGGGGA 


(SEQ 


ID 


NO- 


28 0) ; 


AGGGGGGACT 


(SEQ 


ID 


NO - 


282) ; 


GGGGGACTGG 


(SEO 


TD 


NO ■ 

X^i w > 


284) ; 






TD 




/OD J , 






TPi 
1U 


INU : 


Zoo ) , 
















TD 






AAnnnPTAAT 

nnuuu V_ X Aft X 




TD 




*+ -j*± / / 




V O x-i ^ 


TD 


NO « 


- 7 ° / / 




( qrn 


TD 

X. LJ 


NO < 


OQO \ - 


TAATTPAfTP 


V ODy 


TD 
x xj 


NO - 


j \j \j / f 




{ cyn 

\ Oxi^ 


TD 




t no ^ • 






If 




Tnyi \ . 

; 


7V /"■T'O^/^TV TV TV /*-• 


v SEQ 


ID 


NO : 


3 0 6; ; 


m /"l /I ^"1 TV TV TV TV TV 

TCCCAAAGAA 


(SEQ 


ID 


NO : 


3 08 J ; 


C CAAAG AAG A 


(SEQ 


ID 


NO: 


310) ; 


AAAGAAGACA 


(SEQ 


ID 


NO : 


312) ; 


TV /**"* TV TV T\ /-* TV TV /"» 

AGAAGACAAG 


(SEQ 


ID 


NO : 


314 ) ; 






TPi 
iJj 




jib j ; 


APA AfiZiTAT 




TD 


NO • 


O x, o / / 


CAAGATATCC 


( SEO 


ID 


NO : 


32 0) ; 


AGATATCCTT 


(SEQ 


ID 


NO ' 


3 22) ; 


ATATCCTTGA 


(SEQ 


ID 


NO 


324) • 


ATPPTTGATP 


\ ^ x_> w 


TD 

X LJ 


NO 

Xlt W 


1 S> 6 ^ • 

-J O / / 


CCTTGATCTG 


(SEQ 


ID 


NO 


.328) ; 


TTGATCTGTG 


(SEQ 


ID 


NO 


:330) ; 


GAT CTGTGGA 


(SEQ 


ID 


NO 


:332) ; 


TCTGTGGATC 


(SEQ 


ID 


NO 


:334) ; 


TGTGGATCTA 


(SEQ 


ID 


NO 


:336) ; 


TGGATCTACC 


(SEQ 


ID 


NO 


:338) ; 
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CCACTTTTTA 


(SEQ 


ID 


NO 


265) 


ACTTTTTAAA 


(SEQ 


ID 


NO 


267) 


TTTTTAAAAG 


(SEQ 


ID 


NO 


269) 


TTTAAAAGAA 


(SEQ 


ID 


NO 


271) 


TAAAAGAAAA 


(SEQ 


ID 


NO- 


273) 


AAAGAAAAGG 


(SEQ 


ID 


NO: 


275) 


AGAAAAGGGG 


(SEQ 


ID 


NO: 


277) 


AAAAGGGGGG 


(SEQ 


ID 


NO: 


279) 


AAGGGGGGAC 


(SEQ 


ID 


NO. 


281) 


GGGGGGACTG 


(SEQ 


ID 


NO: 


283) 


GGGGACTGGA 


(SEQ 


ID 


NO: 


285) 


GGACTGGAAG 


(SEO 


ID 


NO : 


287) 


A PTHG AAC5GG 

^T^^w- X UVJJWJVJVJ 


(SEQ 


ID 


NO : 


289) 




(SEQ 


ID 


NO : 


291) 


GAAGGGCTAA 


(SEO 


ID 


NO: 


293 ) 


AGGGCTAATT 


(SEQ 


ID 


NO : 


295) 


GGCTAATTCA 


(SEQ 


ID 


NO: 


297) 


CTAATTCACT 


(SEQ 


ID 


NO: 


299) 


AATTCACTCC 


(SEQ 


ID 


NO: 


301) 


TTCACTCCCA 


(SEQ 


ID 


NO: 


303) 




(SEO 


ID 


NO : 


305) 






TD 


NO - 






/ CT70 


T T-\ 

J. U 


l\\J : 






\ bby 


1U 


InU : 








± JJ 


: 


J ± J ) 






Tn 
X U 






AGACAAGATA 


(SEQ 


ID 


NO : 


317) 


ACAAGATATC 


(SEQ 


ID 


NO- 


319) 


AAGATATCCT 


(SEQ 


ID 


NO- 


321) 


GATATCCTTG 


(SEQ 


ID 


NO 


323) 


TATCCTTGAT 


(SEQ 


ID 


NO 


:325) 


TCCTTGATCT 


(SEQ 


ID 


NO 


:327) 


Ll 1 A 1 L Hjr 1 






"NTH 




TGATCTGTGG 


(SEQ 


ID 


NO 


:331) 


ATCTGTGGAT 


(SEQ 


ID 


NO 


:333) 


CTGTGGATCT 


(SEQ 


ID 


NO 


:335) 


GTGGATCTAC 


(SEQ 


ID 


NO 


:337) 


GGATCTACCA 


(SEQ 


ID 


NO 


:339) 



WO 95/21912 



PCT/AU95/00063 



- 218 - 



GATCTACCAC 


(SEQ 


ID 


NO : 


34 0) ; 


ATCTACCACA 


(SEQ 


ID 


NO 


341) ; 


TCTACCACAC 


(SEQ 


ID 


NO : 


342) ; 


CTACCACACA 


(SEQ 


ID 


NO 


3 4 3) ; 


TAC CACAC AC 


(SEQ 


ID 


NO : 


344) ; 


AC CACAC AC A 


/ T~T/**\ 

(SEQ 


ID 


NO : 


3 4 5} ; 


C C ACACACAA 


(SEQ 


ID 


NO : 


34 6) ; 


CACACACAAG 


(SEQ 


ID 


T\T/-\ 

NO : 


34 7) ; 


ACACACAAGG 


( SEQ 


ID 


NO : 


348) ; 


/~t TV ^"t TV /~1 TV TV /"T /~1 

CACACAAGGC 


(SEQ 


ID 


NO : 


3 4 9) ; 


ACACAAGGCT 


(SEQ 


ID 


NO : 


3 50) ; 


CACAAGGCTA 


(SEQ 


ID 


NO : 


3 51) ; 


ACAAGGCTAC 


(SEQ 


ID 


NO : 


3 52) ; 


CAAGGCTACT 


(SEQ 


ID 


NO : 


3 53) ; 


AAGGCTACTT 


(SEQ 


ID 


NO : 


3 54) ; 


AGGCTACTTC 


(SEQ 


ID 


NO : 


3 55) ; 


GGCTACTTCC 


(SEQ 


ID 


NO : 


3 56) ; 


GCTACTTCCC 


(SEQ 


ID 


NO : 


3 57) ; 


CTACTTCCCT 


(SEQ 


ID 


NO : 


3 58) ; 


TACTTCCCTG 


(SEQ 


ID 


NO : 


3 59) ; 


ACTTCCCTGA 


(SEQ 


ID 


NO: 


360) ; 


CTTCCCTGAT 


(SEQ 


ID 


NO : 


3 61) ; 


TTGCCTGATT 


(SEQ 


ID 


NO : 


3 62) ; 


TCCCTGATTG 


(SEQ 


ID 


NO: 


3 63) ; 


CCCTGATTGG 


(SEQ 


ID 


NO: 


364) ; 


CCTGATTGGC 


(SEQ 


ID 


NO: 


365) ; 


CTGATTGGCA 


(SEQ 


ID 


NO: 


366) ; 


TGATTGGCAG 


(SEQ 


ID 


NO: 


3 67) ; 


GATTGGCAGA 


(SEQ 


ID 


NO: 


3 68) ; 


ATTGGCAGAA 


(SEQ 


ID 


NO: 


369) ; 


TTGGCAGAAC 


(SEQ 


ID 


NO : 


3 70) ; 


TGGCAGAACT 


(SEQ 


ID 


NO: 


371) ; 


GGCAGAACTA 


(SEQ 


ID 


NO : 


3 72) ; 


GCAGAACTAC 


(SEQ 


ID 


NO: 


3 73) ; 


CAGAACTACA 


(SEQ 


ID 


NO : 


3 74) ; 


AGAACTACAC 


(SEQ 


ID 


NO: 


375) ; 


« TA t* m m ^* tj* tx 

GAAC TAC ACA 


(SEQ 


ID 


NO : 


3 76) ; 


AACTACACAC 


(SEQ 


ID 


NO: 


377) ; 


ACTACACACC 


(SEQ 


ID 


NO : 


3 78) ; 


CTACACACCA 


(SEQ 


ID 


NO: 


3 79) ; 


TACACACCAG 


(SEQ 


ID 


NO : 


3 80) ; 


ACACACCAGG 


(SEQ 


ID 


NO: 


3 81) ; 


CACACCAGGG 


(SEQ 


ID 


NO : 


3 82) ; 


ACACCAGGGC 


(SEQ 


ID 


NO : 


3 83) ; 


CACCAGGGCC 


(SEQ 


ID 


NO: 


3 84) ; 


ACCAGGGCCA 


(SEQ 


ID 


NO: 


3 85) ; 


CCAGGGCCAG 


(SEQ 


ID 


NO: 


3 86) ; 


CAGGGCCAGG 


(SEQ 


ID 


NO: 


387) ; 


AGGGCCAGGG 


(SEQ 


ID 


NO : 


388) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO : 


3 89) ; 


GGCCAGGGGT 


(SEQ 


ID 


NO : 


3 90) ; 


GCCAGGGGTC 


(SEQ 


ID 


NO : 


3 91) ; 


CCAGGGGTCA 


(SEQ 


ID 


NO : 


3 92) ; 


CAGGGGTCAG 


(SEQ 


ID 


NO: 


3 93) ; 


AGGGGTCAGA 


(SEQ 


ID 


NO : 


3 94) ; 


GGGGTCAGAT 


(SEQ 


ID 


NO: 


3 95) ; 


GGGTCAGATA 


(SEQ 


ID 


NO : 


3 96) ; 


GGTCAGATAT 


(SEQ 


ID 


NO: 


397) ; 


GTCAGATATC 


(SEQ 


ID 


NO: 


3 9 8) ; 


TCAGATATCC 


(SEQ 


ID 


NO: 


399) ; 


CAGATATCCA 


(SEQ 


ID 


NO : 


4 00) ; 


AGATATCCAC 


(SEQ 


ID 


NO: 


401) ; 


GATATCCACT 


(SEQ 


ID 


NO : 


4 02) ; 


ATATCCACTG 


(SEQ 


ID 


NO : 


4 03) ; 


TATCCACTGA 


(SEQ 


ID 


NO : 


404 ) ; 


ATCCACTGAC 


(SEQ 


ID 


NO : 


4 05) ; 


1 CCACTGACC 


(SEQ 


ID 


NO : 


4 06) ; 


CCACTGACCT 


(SEQ 


ID 


NO : 


4 07) ; 


CACTGACCTT 


(SEQ 


ID 


NO : 


408) ; 


ACTGACCTTT 


(SEQ 


ID 


NO: 


4 09) ; 


CTGACCTTTG 


(SEQ 


ID 


NO: 


410) ; 


TGACCTTTGG 


(SEQ 


ID 


NO: 


411) ; 


GACCTTTGGA 


(SEQ 


ID 


NO: 


412) ; 


ACCTTTGGAT 


(SEQ 


ID 


NO: 


413) ; 


CCTTTGGATG 


(SEQ 


ID 


NO: 


414) ; 


CTTTGGATGG 


(SEQ 


ID 


NO: 


415) ; 
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TTTGGATGGT 


(SEQ 


ID 


NO 


:416) ; 


TTGGATGGTG 


(SEQ 


ID 


NO 


:417) ; 


TGGATGGTGC 


(SEQ 


ID 


NO 


:418) ; 


GGATGGTGCT 


(SEQ 


ID 


NO 


:419) ; 


GATGGTGCTA 


(SEQ 


ID 


NO 


:420) ; 


ATGGTGCTAC 


(SEQ 


ID 


NO 


:421) ; 


TGGTGCTACA 


(SEQ 


ID 


NO 


: 422) ; 


GGTGCTACAA 


(SEQ 


ID 


NO 


:423) ; 


GTGCTACAAG 


(SEQ 


ID 


NO 


:424) ; 


TGCTACAAGC 


(SEQ 


ID 


NO 


:425) ; 


GCTACAAGCT 


(SEQ 


ID 


NO 


: 426) ; 


CTACAAGCTA 


(SEQ 


ID 


NO 


:427) ; 


TACAAGCTAG 


(SEQ 


ID 


NO 


:428) ; 


ACAAGCTAGT 


(SEQ 


ID 


NO 


:429) ; 


CAAGCTAGTA 


(SEQ 


ID 


NO 


:430) ; 


AAGCTAGTAC 


(SEQ 


ID 


NO 


:431) ; 


AGCTAGTACC 


(SEQ 


ID 


NO 


:432) ; 


GCTAGTACCA 


(SEQ 


ID 


NO 


:433) ; 


CTAGTAC CAG 


(SEQ 


ID 


NO 


:434) ; 


TAGTACCAGT 


(SEQ 


ID 


NO 


:435) ; 


AGTACCAGTT 


(SEQ 


ID 


NO 


:436) ; 


GTACCAGTTG 


(SEQ 


ID 


NO 


:437) ; 


TACCAGTTGA 


(SEQ 


ID 


NO 


:438) ; 


ACCAGTTGAG 


(SEQ 


ID 


NO 


:439) ; 


CCAGTTGAGC 


(SEQ 


ID 


NO 


:440) ; 


CAGTTGAGCC 


(SEQ 


ID 


NO 


:441) ; 


AGTTGAGCCA 


(SEO 


ID 


NO 


: 442 ) ; 


GTTGAGCCAG 


(SEO 


ID 


NO 


:443) ; 


TTGAGCCAGA 


(SEQ 


ID 


NO 


:444) ; 


TGAGCCAGAT 


(SEO 


ID 


NO 


:445) ; 


GAGCCAGATA 


(SEQ 


ID 


NO 


: 446 ) ; 


AGCCAGATAA 


(SEQ 


ID 


NO 


:447) ; 


GCCAGATAAG 


(SEQ 


ID 


NO 


: 448) ; 


CCAGATAAGG 


(SEQ 


ID 


NO 


:449) ; 


CAGATAAGGT 


(SEO 


ID 


NO 


: 4 5 0 ) ; 


AGATAAGGTA 


(SEQ 


ID 


NO 


:451) ; 








NO 


• Tt J ^ / / 


ATAAGGTAGA 


(SEO 


ID 


NO 


: 4 5 3 ) ; 






TTi 




: ft Ort / , 






_L LJ 


IN w 


. *± D D / , 


Abu 1 AoAAbA 




±u 




: ft jo j ; 










. *± o / y / 






x, U 




.100/ / 




( SFO 


in 


NO 


* '-t ~-> y } f 


AGAAG AGG C C 


( SEO 


ID 


NO 


: 4 6 0 ) ; 


GAAGAGGCCA 


(SEQ 


ID- 


NO 


:461) ; 


AAGAGGC CAA 


(SEQ 


ID 


NO 


:462) ; 


AGAGGCCAAT 


(SEQ 


ID 


NO 


:463) ; 


GAGGCCAATA 


(SEQ 


ID 


NO 


:464) ; 


AGGCCAATAA 


(SEQ 


ID 


NO 


:465) ; 


GGCCAATAAA 


(SEQ 


ID 


NO 


:466) ; 


GCCAATAAAG 


(SEQ 


ID 


NO 


:467) ; 


CCAATAAAGG 


(SEQ 


ID 


NO 


:468) ; 


CAATAAAGGA 


(SEQ 


ID 


NO 


:469) ; 


AATAAAGGAG 


(SEQ 


ID 


NO 


:470) ; 


ATAAAGGAGA 


(SEQ 


ID 


NO 


:471) ; 


TAAAGGAGAG 


(SEO 


ID 


NO 


: 4 72) ; 


AAAGGAGAGA 


(SEQ 


ID 


NO 


:473) ; 


AAGGAGAGAA 


(SEQ 


ID 


NO 


:474) ; 


AGGAGAGAAC 


(SEQ 


ID 


NO 


:475) ; 


GGAGAGAACA 


(SEQ 


ID 


NO 


:476) ; 


GAGAGAACAC 


(SEQ 


ID 


NO 


:477) ; 


AGAGAACACC 


(SEQ 


ID 


NO 


:478) ; 


GAGAACACCA 


(SEQ 


ID 


NO 


:479) ; 


a n a a r*A r*r* a n 




Tr» 
X u 




. *± O \J J f 


dAAPAPCAGC 


( SEO 

\ i—i 


ID 

X XV 


NO 


• 4 ft 1 ^ * 


AACACCAGCT 


(SEQ 


ID 


NO 


:482) ; 


ACACCAGCTT 


(SEQ 


ID 


NO 


:483) ; 


CACCAGCTTG 


(SEQ 


ID 


NO 


:484) ; 


ACCAGCTTGT 


(SEQ 


ID 


NO 


:485) ; 


CCAGCTTGTT 


(SEQ 


ID 


NO 


:486) ; 


CAGCTTGTTA 


(SEQ 


ID 


NO 


:487) ; 


AGCTTGTTAC 


(SEQ 


ID 


NO 


:488) ; 


GCTTGTTACA 


(SEQ 


ID 


NO 


:489) ; 


CTTGTTACAC 


(SEQ 


ID 


NO 


:490) ; 


TTGTTACACC 


(SEQ 


ID 


NO 


:491) ; 
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TGTTACACCC (SEQ ID NO: 4 92) 
TTAGACCCTG (SEQ ID NO: 4 94) 
ACACCCTGTG (SEQ ID NO: 4 96) 
ACCCTGTGAG (SEQ ID NO: 4 98) 
CCTGTGAGCC (SEQ ID NO: 500) 
TGTGAGC CTG (SEQ ID NO: 502) 
TGAGCCTGCA (SEQ ID NO: 5 04) 
AGCCTGCATG (SEQ ID NO: 50 6) 
CCTGCATGGA (SEQ ID NO: 50 8) 
TGCATGGAAT (SEQ ID NO: 510) 
CATGGAATGG (SEQ ID NO: 512) 
TGGAATGGAT (SEQ ID NO: 514) 
GAATGGATGA (SEQ ID NO: 516) 
ATGGATGACC (SEQ ID NO: 518) 
GGATGACCCT (SEQ ID NO: 520) 
ATGACCCTGA (SEQ ID NO: 522) 
GACCCTGAGA (SEQ ID NO: 524) 
CCCTGAGAGA (SEQ ID NO: 526) 
CTGAGAGAGA (SEQ ID NO: 528) 
GAGAGAGAAG (SEQ ID NO: 53 0) 
GAGAGAAGTG (SEQ ID NO: 532) 
GAGAAGTGTT (SEQ ID NO: 534) 
GAAGTGTTAG (SEQ ID NO:536) 

AGTGTTAGAG ( SEQ ID NO : 5 3 8 ) 

TGTTAGAGTG (SEQ ID NO: 540), 

TTAGAGTGGA (SEQ ID NO: 542), 

AGAGTGGAGG (SEQ ID NO: 544), 

AGTGGAGGTT- (SEQ ID NO: 546); 

TGGAGGTTTG ( SEQ ID NO : 54 8 ) ; 

GAGGTTTGAC ( SEQ ID NO : 5 5 0 ) ; 

GGTTTGACAG (SEQ ID NO: 552) ; 

TTTGACAGCC (SEQ ID NO:554); 

TGACAGCCGC (SEQ ID NO: 556); 

ACAGCCGCCT (SEQ ID NO: 5 58); 

AGCCGCCTAG (SEQ ID NO: 5 60); 

CCGCCTAGCA (SEQ ID NO:562); 

GCCTAGCATT (SEQ ID NO: 5 64); 

CTAGCATTTC (SEQ ID NO:566); 
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GTTACACCCT 


(SEQ 


ID 


NO 


: 4 9 3 ) ; 


1 ACACCCTGT 


(SEQ 


ID 


NO 


: 4 9 5 ) ; 


CACCCTGTGA 


(SEQ 


ID 


NO 


: 4 97) ; 


CCCTGTGAGC 


(SEQ 


ID 


NO 


: 4 9 9 ) ; 


CTGTGAGCCT 


(SEQ 


ID 


NO 


: 501) ; 


GTGAGCCTGC 


(SEQ 


ID 


NO 


:503) ; 


GAGCCTGCAT 


(SEQ 


ID 


NO 


:505) ; 


GCCTGCATGG 


(SEQ 


ID 


NO 


:507) ; 


CTGCATGGAA 


(SEQ 


ID 


NO 


:509) ; 


GCATGGAATG 


(SEQ 


ID 


NO 


:511) ; 


ATGGAATGGA 


(SEQ 


ID 


NO 


:513 ) ; 


GGAATGGATG 


(SEQ 


ID 


NO 


.515) ; 


AATGGATGAC 


(SEQ 


ID 


NO 


517) / 


TGGATGACCC 


(SEQ 


ID 


NO: 


519) ; 


G ATG AC C CTG 


(SEQ 


ID 


NO : 


521) ; 


TGACCCTGAG 


(SEQ 


ID 


NO: 


523) ; 


ACCCTGAGAG 


(SEQ 


ID 


NO: 


525) ; 


CCTGAGAGAG 


(SEQ 


ID 


NO: 


527) ; 


TGAGAGAGAA 


(SEQ 


ID 


NO: 


52 9) ; 


AGAGAGAAGT 


(SEQ 


ID 


NO: 


531) ; 


AGAGAAGTGT 


(SEQ 


ID 


NO: 


533) ; 


AG AAGTGT T A 


(SEQ 


ID 


NO: 


535) ; 


AAGTGTTAGA 


(SEQ 


ID 


NO: 


537) ; 


GTGTTAGAGT 


(SEQ 


ID 


NO: 


539) ; 


GTTAGAGTGG 


(SEQ 


ID 


NO: 


541) ; 


TAGAGTGGAG 


(SEQ 


ID 


NO: 


543) ; 


GAGTGGAGGT 


(SEQ 


ID 


NO: 


545) ; 


GTGGAGGTTT 


(SEQ 


ID 


NO: 


547) ; 


GGAGGTTTGA 


(SEQ 


ID 


NO: 


54 9) / 


AGGTTTGACA 


(SEQ 


ID 


NO: 


551) ; 


GTTTGACAGC 


(SEQ 


ID 


NO: 


553) ; 


TTGACAGCCG 


(SEQ 


ID 


NO: 


555) ; 


GACAGCCGCC 


(SEQ 


ID 


NO : 


557) ; 


CAGC CG CCT A 


(SEQ 


ID 


NO : 


559) ; 


GCCGCCTAGC 


(SEQ 


ID 


NO: 


561) ; 


CGCCTAGCAT 


(SEQ 


ID 


NO: 


563) ; 


CCTAGCATTT 


(SEQ 


ID 


NO: 


565) ; 


TAGCATTTCA 


(SEQ 


ID 


NO: 


567) ; 
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AGCATTTCAT 


(SEQ 


ID 


NO 


:568) ; 


GCATTTCATC 


(SEQ 


ID 


NO 


: 5 6 9 ) / 




CATTTCATCA 


(SEQ 


ID 


NO 


:570) ; 


ATTTCATCAC 


(SEQ 


ID 


NO 


: 5 7 1 ) ; 




TTTCATCACG 


(SEQ 


ID 


NO 


:572) ; 


TTCATCACGT 


(SEQ 


ID 


NO 


: 573) ; 




TCATCACGTG 


(SEQ 


ID 


NO 


:574) ; 


CATCACGTGG 


(SEQ 


ID 


NO 


: 575) ; 




ATCACGTGGC 


(SEQ 


ID 


NO 


:576) ; 


TCACGTGGCC 


(SEQ 


ID 


NO 


: 577) ; 




CACGTGGCCC 


(SEQ 


ID 


NO 


:578) ; 


ACGTGGCCCG 


(SEQ 


ID 


NO 


: 5 7 9 ) / 




CGTGGCCCGA 


(SEQ 


ID 


NO 


: 58 0) ; 


GTGGCCCGAG 


(SEQ 


ID 


NO 


:581) ; 




TGGCCCGAGA 


(SEQ 


ID 


NO 


:582) ; 


GGCCCGAGAG 


(SEQ 


ID 


NO 


:583) ; 




GCCCGAGAGC 


(SEQ 


ID 


NO 


:584) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO 


:585) ; 




CCGAGAGCTG 


(SEQ 


ID 


NO 


:586) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO 


: 5 8 7 ) ; 




GAGAGCTGCA 


(SEQ 


ID 


NO 


:588) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO 


: 5 8 9 ) ; 




GAGCTGCATC 


(SEQ 


ID 


NO 


:590) ; 


AGCTGCATCC 


(SEO 


ID 


NO 

A 1 ! KS 


M J J J , 




GCTGCATCCG 


(SEQ 


ID 


NO: 


592) ; 


CTGCATOCGG 


V O Hi V 


TD 


MO 


► C Q o \ . 

O z) J ; ; 




TGCATCCGGA 


(SEQ 


ID 


NO: 


594) ; 






TP) 


vi\j : 


C QC \ . 

->yo| ; 




CATCCGGAGT 


(SEQ 


ID 


NO: 


596) ; 


ATCCGGAGTA 


(SEO 


TD 




/ ) , 




TCCGGAGTAC 


(SEQ 


ID 


NO: 


598) ; 


CCGGAGTACT 


(SEO 


ID 


NO • 


-) j zj ) f 


* * % ■ 


CGGAGTACTT 


(SEQ 


ID 


NO: 


600) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 


601) ; 




GAGTACTTCA 


(SEQ 


ID 


NO: 


602) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 


603 ) ; 




GTACTTCAAG 


(SEQ 


ID 


NO: 


604) ; 


TACTTCAAGA 


(SEQ 


ID 


NO: 


605); 




ACTTCAAGAA 


(SEQ 


ID 


NO: 


606) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 


607) ; 




TTCAAGAACT 


(SEQ 


ID 


NO: 


608) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 


609) ; 




CAAGAACTGC 


(SEQ 


ID 


NO: 


610) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 


611) ; 




AGAACTGCTG 


(SEQ 


ID 


NO: 


612) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 


613) . 





49. A method according to claim 44 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of fflV-1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID 


NO: 


652) ; 


CTTTTTGCCT 


(SEQ 


ID 


NO: 


653) ; 


TTTTTGCCTG 


(SEQ 


ID 


NO: 


654) / 


TTTTGCCTGT 


(SEQ 


ID 


NO: 


655) ; 


TTTGCCTGTA 


(SEQ 


ID 


NO: 


656) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO: 


657) ; 


TGCCTGTACT 


(SEQ 


ID 


NO: 


658) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 


659) ; 


CCTGTACTGG 


(SEQ 


ID 


NO: 


660) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 


661) ; 


TGTACTGGGT 


(SEQ 


ID 


NO: 


662) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 


663) ; 


TACTGGGTCT 


(SEQ 


ID 


NO: 


664) ; 


ACTGGGTCTC 


(SEQ 


ID 


NO: 


665) ; 


CTGGGTCTCT 


(SEQ 


ID 


NO: 


666) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO: 


667) ; 


GGGTCTCTCT 


(SEQ 


ID 


NO: 


668) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO: 


669) ; 


GTCTCTCTGG 


(SEQ 


ID 


NO: 


670) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 


671) ; 


CTCTCTGGTT 


(SEQ 


ID 


NO: 


672) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


673) ; 
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CTCTGGTTAG 


( SEQ 


ID 


NO 


: 6 74 y 


CTGGTTAGAC 


(SEQ 


ID 


NO 


. /T *~7 P" \ 


GGTTAGACCA 


(SEQ 


ID 


NO 


: 678 ) 


TT AGAC C AG A 


(SEQ 


ID 


NO 


: 680 ) 


AGACCAGATC 


(SEQ 


ID 


NO 


: 682 ) 


ACCAGATCTG 


(SEQ 


ID 


NO 


: 684 ) 


C AG AT C TGAG 


( SEQ 


ID 


NO 


: 686 ) 


/~t 7\ rp v**irp/~i 7\ **i /*-» /-i 

■oA 1 <w 1 (jAGL l. 


( SEQ 


ID 


NO 


c o o \ 

' 688 ; 


I CI CjALjCC 1 Li 


( SEQ 


ID 


NO 


• 6 90 ) 


TGAGCCTGGG 


(SEQ 


ID 


NO 


: 6 92 ) 


AGCCTGGGAG 


(SEQ 


ID 


NO 


: 694) 


CCTGGGAGCT 


(SEQ 


ID 


NO 


: 696 ) 


TGGGAGCTCT 


(SEQ 


ID 


NO 


: 698) 


GGAGCTCTCT 


(SEQ 


ID 


NO 


700) 


AGCTCTCTGG 


(SEQ 


ID 


NO ' 


1 702 ) 


CTCTCTGGCT 


(SEQ 


ID 


NO 


704) 


CTCTGGCTAA 


(SEQ 


ID 


NO: 


706) 


CTGGCTAACT 


(SEQ 


ID 


NO: 


708) 


GGCTAACTAG 


(SEQ 


ID 


NO: 


710) 


CTAACTAGGG 


(SEQ 


ID 


NO : 


712) 


AACTAGGGAA 


(SEQ 


ID 


NO : 


714 ) 


CTAGGGAACC 


(SEQ 


ID 


NO : 


716 ) 


AGGGAACCCA 


(SEQ 


ID 


NO : 


718 ) 


GGAACCCACT 


(SEQ 


ID 


NO : 


720) 


"ft "ft /**! /""I 7« flmrtri 

AACCCACTGC 


(SEQ 


ID 


NO : 


722 ) 


CCCACTGCTT 


(SEQ 


ID 


NO : 


724 ) 


CACTG CTTAA 


(SEQ 


ID 


NO : 


726 ) 


CTGCTTAAGC 


(SEQ 


ID 


NO : 


728 ; 


GCTTAAGCCT 


(SEQ 


ID 


NO : 


73 0 ) 


mm t\ tv <^/^om/^iv 

TTAAGCCTCA 


(SEQ 


ID 


NO : 


732 ) 


iv tv ^t^n r*t tv tv m 

AAGCCTCAAT 


(SEQ 


ID 


NO : 


734 ) 


GCCTCAATAA 


(SEQ 


ID 


NO : 


736) 


CTCAATAAAG 


(SEQ 


ID 


NO : 


738) 


TV TV m TV TV TV s~i m 

CAATAAAGCT 


(SEQ 


ID 


NO : 


740 ) 


ATAAAGCTTG 


(SEQ 


ID 


NO: 


742) 


AAAGCTTGCC 


(SEQ 


ID 


NO: 


744) 


AGCTTGCCTT 


(SEQ 


ID 


NO: 


746) 


CTTGCCTTGA 


(SEQ 


ID 


NO: 


748) 



222 - 



r pr^ r Pf~ ,r T > nr2. r v r T t & 

1L1L1 1 iH 




TD 




6 7^ ) - 


tppttzipzi pp 




in 


NO • 


fill) • 




/ CUP) 


TFi 




£ 1 Q ) • 


J. Avj AU L-AUi-V X 


/ OTTO 


X JJ 


TvTP « 


boxy , 


PAPPAP A 'PPT' 






TvTpA . 
JNU . 


DO j ) , 


ppapa nrr^TT 1 a 

UUA l oi-V ILi vjM. 


( QT?P 


T Pi 




OO Dj / 


iiuii 1L1 U 




x u 




£ 9, 1 \ * 


M. JL ^- 1 Lji-iVj^w \— J. 






MP - 
J-iU • 


v O _? / , 


PTP A P PPTPP 




TFi 
X u 


"KTO • 


d y x / / 


P^PPPTPPPA 




TPl 

X u 


MP • 

IN v_/ . 


COT 1 . 




( CT?P 


TPl 
XL) 




O J J J , 






XiJ 


mp . 


C Q *7 A . 

by/; ; 


/-ip^pi 7\ /~i p^n~'/^ l 'T , p^ 




T Ft 
XU 


MP . 

L\u : 


P Q Q \ . 


GAGCTCTCTG 


/ O T~1 P\ 

(SEQ 


T Fi 
XJJ 


JNU : 


/Ol; ; 


/^rp rtrn /™*rn /-'i /~i 

GCTCTCTGGC 


(SEQ 


T Fl 

XU 


"KT/^i . 

WU : 


*~7 Pi *5 \ . 

/Ui) ; 


TCTCTGGCTA 


(SEQ 


ID 


NO : 


t r\ c* \ 

7 0 5) / 


TCTGGCTAAC 


(SEQ 


ID 


NO : 


rift o \ 

7 0 7) ; 


m /~ * /~ i m tv. tv no TV 

TGGCTAACTA 


(SEQ 


ID 


NO : 


nf p* pi \ 

7 0 9) ; 


GCTAACTAGG 


(SEQ 


ID 


NO : 


711) ; 


TAACTAGLjCA 


(SEQ 


T Pi 

ID 


NO : 


713 ; ; 


AC J. AtjCrLxAAC 




TFi 
XU 


JNU : 


/ x b ; ; 


TAPPPA A P^P^P 1 


(o£jU 


TFi 
XU 


JNU : 


/ 1 / ; ; 


VjKjVj AAL L. UAL 


/ CT7P 


T Pi 
XU 


JNU : 


/iy j ; 






TFi 
X D 


T\TPi . 

rsiu : 


HOI \ . 

/ ^ X ; ; 




/ cup 


TFi 
XU 


jnu : 


/ z j ; ; 


ppT, pr-pz-i p^TT 1 7A 




TFi 
XU 


jnu : 


/ ^ b ; ; 


tv PTPPTTS TAP 


/ QUO 


TFi 
XU 


iNU : 






/ OTTO 


T Pi 
XU 


MP . 

jnu : 


77Q\ . 

/ Zy ) ; 


pTfp a tv n PPTP 




TFi 
X U 


MP . 

jnu : 


/ j X ; ; 






TFi 
XU 


MP . 

jnu : 


7T3 \ . 

/ j J J ; 


7A P PPHTP 7A A T* 7A 




TPi 
X D 


MP . 

jnu : 


7*3 c \ . 


P^P^TP* A A T* A A A 
1 LAA1AAA 




T Pi 
XU 


MP** . 

INU : 


7-)T\ . 

/ J / ; ; 


TP A ATA A A PP 


( CT?P 


TFi 
XU 


MP\ . 

INU : 


/ o y ; ; 


A ATA A APPTT 




TFi 
XU 


MP - 

jnu : 


Ty-I t \ . 

J ; 


TAAAGCTTGC 


(SEQ 


ID 


NO : 


74 3) ; 


AAGCTTGCCT 


(SEQ 


ID 


NO: 


745) ; 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


747) ; 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) ; 
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lbrCU J. 1 VjAbiX 


( oyn 


xJJ 




/ O U } f 


HP'PTTGAGTG 


f SEO 


ID 


NO : 


751) * 


p/imrnp 7, /-trp/-i /-1 

LL1 1 GACj 1 CjU 




Tn 

±U 


iNb/ . 




\^ X X \JiT.VJ X VJJ ^ x 


(SEO 

\ 0 •'-•V 


ID 


NO : 


753 ) ; 


rprp/-i tv /""T 1 /*" 1 /™"'! "P 




X U 


JNb> . 


I^A \ ' 


X uriu X wv- X X\_- 


(SEO 


ID 


NO : 


755) ; 


CjACj I Qj(~ 1 1 UA 




JL jj 


INb' . 


/DO j / 




(SEO 


ID 


NO : 


757) ; 


G I CjC 1 I CAAG 


/ O TO 

(bby 


jn 
1U 


JNU : 


/DO/ , 


X X X \~r\r\\3 X 


( SEO 


ID 


NO : 


759) - 


GC 1 1 L AAG 1 A 


(bli(J 


tn 
1JJ 


ATO - 

jnu : 




X X \^r\r\\J X rtO 


(SEO 


ID 


NO : 


761) ; 


1 1 LAAb I Abi 1 




xJJ 


XT Pi - 




TP A AGTAGTG 


(SEO 


ID 


NO : 


763) ; 






-L xy 


mo • 


764 } - 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


765) ; 


AGTAOTGTGT 


(SEO 


ID 


NO : 


766 ) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO: 


767) ; 


TAGTGTGTGC 


(SEO 


ID 


NO : 


76 8) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO: 


769) ; 


OTGTOTGPPP 


(SEO 


ID 


NO : 


770) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 


771) ; 


(TTGTGPPPOT 


( sfo 


TD 


NO - 


77? ) • 


TGTGCCCGTC 


(SEQ 


ID 


NO: 


773) ; 






TVi 


"MO * 


11 a \ • 


T^PPPGTPTG 


(SEO 


ID 


NO : 


775) ; 


GCCCG1 CTGx 


(SEQ 


ID 


JNU : 


*~I H C \ . 

/ / 0 ) ; 


nrrpTrTPTT 

LLLbl 1 \j 1 1 




X u 


NO • 


777\ . 

fit!/ 


CCGTCTGTTG 


(SEQ 


ID 


JNU : 


/to) ; 






x xy 


NO - 


119) - 


GTCTGTTGTG 


(SEQ 


ID 


NO : 


T O A \ . 

7 0 0 ) ; 


r ?r ,r pr ir r r rr ,r pn r r 
ILIbl XvjXLal 




TD 

X LJ 


MO • 


7 ft 1 ) • 

/ O X J f 


CTGTTGTGTG 


(SEQ 


ID 


NO : 


78/) ; 


rT»/~t np r P/^ i T 1 ^ 1 H" 1 /^"' A 

XvjX Ivalvjl 




th 

X LJ 


MO • 


7ft^ ) - 
/ 0 j j / 


GTTGTGTGAC 


(SEQ 


ID 


•»TA 

NO : 


7 84) ; 


I Ibibl bAb 1 




Tn 
x u 




/ O } / 


TGTGTGACTC 


(SEQ 


ID 


NO : 


786) ; 


1 1 bx AC 1L1 




Tn 




1R1) • 

/Of// 


TGTGTGACTC 


(SEQ 


ID 


NO : 


788) ; 


GTGTGACTCT 


(SEQ 


ID 


NO : 


*~l 0 f\ \ 

78 9) ; 


TGTGACTCTG 


(SEQ 


ID 


NO: 


790) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 


TGACTCTGGT 


(SEQ 


ID 


NO: 


792) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 


793) ; 


ACTCTGGTAA 


(SEQ 


ID 


NO: 


794) ; 


CTCTGGTAAC 


(SEQ 


ID 


NO: 


795) ; 


TCTGGTAACT 


(SEQ 


ID 


NO: 


796) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 


797) ; 


TGGTAACTAG 


(SEQ 


ID 


NO: 


798) ; 


GGTAACTAGA 


(SEQ 


ID 


NO: 


799) . 



50. A method according to claim 24 wherein said HIV-1 isolate has the identifying 
characteristics of HIV-1 isolate ECACC V94101706 or HIV-1 isolate ECACC 
V94 1031 169 or isolate C54 deposited at ECACC on 14 February, 1995 under Accession 
Number 



51. A method according to claim 20 wherein the HIV-1 isolate is reactive to 
antibodies to a glycoprotein of HIV-1, is capable of inducing an immune response of 
gag, pol and/or env and which is incapable of directing synthesis of a nef gene product 
or a full length nef gene product. 
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52. A method according to claim 51 wherein the glycoprotein is at least one of gp41- 
45, gpl20 and/or gpl60. 

53. A method for obtaining a preparation of non-pathogenic HIV- 1 from a biological 
sample, said method comprising co-culturing PBMCs from said biological sample from 
an individual putatively infected with said non-pathogenic HIV-1 with HIV-1 
seronegative donor PBMCs depleted for CD8+ cells, harvesting the PBMCs and 
supernatant fluid every from about 5 to about 10 days and adding fresh medium with 
CD8+ depleted PBMCs with said fresh medium and isolating said virus from the 
supernatant fluid. 



54. A method according to claim 53 wherein the seronegative PBMCs are pretreated 
with a cytokine for a time and under conditions sufficient to enhance HIV-1 replication. 

55. A method according to claim 54 wherein the cytokine is M-CSR 

56. A method according to claim 53 or 54 or 55 wherein the HIV-1 seronegative 
donor PBMCs are first stimulated with PHA and/or IL-2. 



57. A method according to claim 53 wherein the PBMCs from a putative infected 
individual are also CD8+ depleted prior to coculturing with seronegative donor PBMCs. 

58. A method according to claim 53 or 57 wherein the cocultured cells are subjected 
to UV irradiation. 



59. A method for obtaining a preparation of non-pathogenic HIV- 1 from a biological 
sample, said method comprising co-culturing monocytes from said biological sample 
from an individual putatively infected with said non-pathogenic HIV-1 with HIV-1 
seronegative donor PBMCs depleted for CD8+ cells, harvesting the monocytes and 
PBMCs and supernatant fluid every from about 5 to about 10 days and adding fresh 
medium with CD8+ depleted PBMCs with said fresh medium and isolating said virus 
from the supernatant fluid. 
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60/ A method according to claim 59 wherein the seronegative PBMCs are pretreated 
with a cytokine for a time and under conditions sufficient to enhance HIV-1 replication. 

61. A method according to claim 60 wherein the cytokine is M-CSF. 

62. A method according to claim 59 or 60 or 61 wherein the HIV-1 seronegative 
donor PBMCs are first stimulated with PHA and/or IL-2. 



63. A method according to claim 59 wherein the monocytes from a putative infected 
individual are also CD8+ depleted prior to coculturing with seronegative donor PBMCs. 

64. A method according to claim 59 or 63 wherein the cocultured cells are subjected 
to UV irradiation. 

65. A method for identifying or screening for compounds capable of reducing or 
otherwise interfering with HIV-1 replication, said method comprising contacting a 
compound to be tested with a cell or cell extract containing or capable of containing a 
nef gene product fused to a reporter molecule capable of giving an identifiable signal 
and screening for a compound which inhibits said signal. 

66. A method according to claim 65 wherein the compound is an antibody to nef 
gene product or a part thereof. 

67. A method according to claim 65 wherein the compound is a DNA targeting agent 
and inhibits transcription of the ra/gene. 

68. A compound capable of inhibiting nef gene product function or nef gene 
transcription. 

69. A compound capable of inhibiting nef gene product function or nef gene 
transcription identified according to the method of claim 65 or 66 or 67. 
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70. A therapeutic composition useful for inhibiting or reducing productive infection 
by a pathogenic strain of HIV-1 and/or for vaccinating an individual against the 
development of AIDS or AIDS-related diseases, said composition comprising a non- 
pathogenic strain of HIV-1 according to any one of claims 1 to 16 and one or more 
pharmaceutical acceptable carriers and/or diluents. 

71. A therapeutic composition according to claim 70 wherein said HIV-1 isolate 
comprises genetic material that directs expression of antisense or ribozyme nucleotide 
sequences which inhibit production of one or more proteins encoded by a pathogenic 
strain of fflV-L 

72. A viral isolate which: 

0) is genetically or immunologically related to a pathogenic strain of HIV- 1 ; 

00 is substantially non-pathogenic in human subjects; 

0*0 comprises a first nucleotide sequence constituting its genome which is 



of a non-protein encoding a region of a pathogenic strain of HIV-1. 
A viral isolate according to claim 72 wherein the targeted protein or polypeptide 



74, A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of env or tat. 



capable of hybridising under medium stringency conditions to SEQ ID 
NO: 1 or a complementary form thereof; and 



(iv) 



comprises a second nucleotide sequence within said first nucleotide 
sequence and which second nucleotide sequence directs expression of a 
mRNA molecule capable of inhibiting, reducing or otherwise down- 
regulating translation of a protein or polypeptide encoded by a pathogenic 
strain of HIV-1 or inhibit, reduce or otherwise down regulate operation 



is selected from the list consisting of gag or pol 



75. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of rev or vpu. 
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76. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of vpr, vif or nef. 

77. A viral isolate according to claim 72 wherein the targeted protein is nef. 

78. A method for determining the pathogenicity of an HIV-1 strain after said HIV-1 
strain infects cells of an individual, said method comprising determining the presence 
of a deletion mutation in the genome of said HIV-1 wherein said deletion mutation 
results in said genome being unable to synthesise a polypeptide or protein from a 
pathogenic strain of HIV-1 or directing the synthesis of a truncated form of said 
polypeptide or protein wherein the presence of a such a mutation is indicative of the 
presence of a non-pathogenic strain of HIV-1. 

79. The method according to claim 78 wherein said non-pathogenic HIV-1 carries 
a deletion in its genome of at least 10 nucleotides within the region from nucleotide 
8787 to nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

80. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 

nucleotide (i) 8830-8862; 



81. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 



(ii) 
(iii) 



9009-9035; 



9019-9029; and 



(iv) 



9033-9049. 



nucleotide 



(v) 
(vi) 



9281-9371 



9281-9362 



(vii) 
(viii) 



9105-9224 



and 



9271-9370. 
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82. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 



83. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



A strain of HIV-1 according to claim 79 wherein said deletion encompasses at 



least one of the 


following decanucleotides from the nef gene of HIV-1 strain NL4-3: 


ATGGGTGGCA 


(SEQ 


ID 


NO 


:2) ; 


TGGGTGGCAA 


(SEQ ID 


NO 


:3) ; 


GGGTGGCAAG 


(SEQ 


ID 


NO 


:4) ; 


GGTGGCAAGT 


(SEQ ID 


NO 


:5) ; 


GTGGCAAGTG 


(SEQ 


ID 


NO 


:6) ; 


TGGCAAGTGG 


(SEQ 


ID 


NO: 7) 


GGCAAGTGGT 


(SEQ 


ID 


NO 


:8) ; 


GCAAGTGGTC 


(SEQ ID 


NO 


:9) ; 


CAAGTGGTCA 


(SEQ 


ID 


NO 


:10) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO: 11) 


AGTGGTCAAA 


(SEQ 


ID 


NO 


:12) ; 


GTGGTCAAAA 


(SEQ 


ID 


NO : 1 3 ) 


TGGTCAAAAA 


(SEQ 


ID 


NO 


= 14) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO: 15) 


GTCAAAAAGT 


(SEQ 


ID 


NO 


:16) ; 


TCAAAAAGTA 


(SEQ 


ID 


NO: 17) 


CAAAAAGTAG 


(SEQ 


ID 


NO 


:18) ; 


AAAAAGTAGT 


(SEQ ID 


NO 


:19) ; 


AAAAGTAGTG 


(SEQ 


ID 


NO 


:20) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO:21) 


AAGTAGTGTG 


(SEQ 


ID 


NO 


:22) ; 


AGTAGTGTGA 


(SEQ 


ID 


NO: 23) 


GTAGTGTGAT 


(SEQ 


ID 


NO 


:24) ; 


TAGTGTGATT 


(SEQ ID 


NO 


:25) ; 


AGTGTGATTG 


(SEQ 


ID 


NO 


:26) ; 


GTGTGATTGG 


(SEQ 


ID 


NO:27) 


TGTGATTGGA 


(SEQ 


ID 


NO 


:28) ; 


GTGATTGGAT 


(SEQ 


ID 


NO:29) 


TGATTGGATG 


(SEQ 


ID 


NO 


:30) ; 


GATTGGATGG 


(SEQ 


ID 


NO:31) 


ATTGGATGGC 


(SEQ 


ID 


NO 


:32) ; 


TTGGATGGCC 


(SEQ 


ID 


NO: 33) 


TGGATGGCCT 


(SEQ 


ID 


NO 


= 34) ; 


GGATGGCCTG 


(SEQ 


ID 


NO : 3 5 ) 


GATGGCCTGC 


(SEQ 


ID 


NO 


:36) ; 


ATGGCCTGCT 


(SEQ 


ID 


NO:37) 
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TGGPPTGCTG 


(SEO 


ID 


NO : 


38) ,v 


GGCCTGCTGT 


(SEQ 


ID 


NO : 3 9 ) 


GPPTGPTGTA 


(SEO 


ID 


NO : 


40) ; 


CCTGCTGTAA 


(SEQ 


ID 


NO:41) 


PTGPTGTAAG 


(SEO 


ID 


NO: 


42) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO: 43) 


GPTGTAAGGG 


(SEO 


ID 


NO : 


44) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO: 45) 


TGTAAGGGAA 


(SEO 


ID 


NO : 


46) ; 


GTAAGGGAAA 


(SEQ 


ID 


NO: 47) 


TAAGGGAAAG 


(SEQ 


ID 


NO : 


48) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO: 49) 


AGGGAAAGAA 


(SEQ 


ID 


NO: 


50) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO: 51) 


GGAAAGAATG 


(SEQ 


ID 


NO: 


52) ; 


GAAAGAATGA 


(SEQ 


ID 


NO: 53) 


AAAGAATGAG 


(SEQ 


ID 


NO: 


54) ; 


AAGAATGAGA 


(SEQ 


ID 


NO: 55) 


AGAATGAGAC 


(SEQ 


ID 


NO: 


56) ; 


GAATGAGACG 


(SEQ 


ID 


NO: 57) 


AATGAGACGA 


(SEQ 


ID 


NO: 


58) ; 


ATGAGACGAG 


(SEQ 


ID 


NO: 59) 


TGAGACGAGC 


{SEQ 


ID 


NO: 


60) ; 


GAGACGAGCT 


(SEQ 


ID 


NO:61) 


AGAPGAGCTG 


(SEO 


ID 


NO : 


62) ; 


GACGAGCTGA 


(SEQ 


ID 


NO: 63) 


APGAGPTGAG 


( ^EO 

\ *-JX^v 


ID 


NO : 


64) • 


CGAGCTGAGC 


(SEQ 


ID 


NO:65) 


r^AnpTfiAGPP 


( <3EO 


ID 


NO - 

XH \J * 


66) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO : 6 7 ) 


GPTGAGPPAG 


(SEO 


ID 


NO : 


68) ; 


CTGAGCCAGC 


(SEQ 


ID 


NO:69) 


TGAGPPAGPA 


(SEO 


ID 


NO : 


70) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO: 71) 






X, LJ 


NO * 


/ £. } / 


GCCAGCAGCA 


(SEQ 


ID 


NO: 73) 




( oCiVj 


JLL) 


"NTH • 


74. ^ * 


PAfiPAGPAGA 


(SEO 


ID 


NO : 75) 


AGCAGCAGAT 


(SEQ 


ID 


JnU : 


/o ; ; 




( QFO 
V ^x^v 


TD 

X> LJ 


NO : 7 7 ) 


CAG C AG ATGG 


/ o r*\ 

(SEQ 


ID 


InU : 


/ W ; ; 






TD 


NO : 7 9 ) 


GCAGATGGGO 


/ OTTO 

(SEQ 


T 


viU : 


on \ * 

du j ; 




f ^EO 


ID 

X XJ 


NO : 8 1 ) 


ALsA 1 X 




Tn 

1U 


Vi \J . 




GATGGGGTGG 

sjil X WWW X \>J\^i 


(SEQ 


ID 


NO : 8 3 ) 


ATGGGGTGGG 


(SEO 


ID 


NO : 


84 ) ; 


TGGGGTGGGA 


(SEQ 


ID 


NO: 85) 


GGGGTGGGAG 


(SEQ 


ID 


NO : 


86) ; 


GGGTGGGAGC 


(SEQ 


ID 


NO: 87) 


GGTGGGAGCA 


(SEQ 


ID 


NO: 


88) ; 


GTGGGAGCAG 


(SEQ 


ID 


NO : 8 9 ) 


TGGGAGCAGT 


(SEQ 


ID 


NO 


.90) ; 


GGGAGCAGTA 


(SEQ 


ID 


NO:91) 


HGAGPAGTAT 


(SEO 


ID 


NO 


: 92 ) ; 


GAGCAGTATC 


(SEQ 


ID 


NO:93) 


AnPAfVTATPT 


( SEO 


ID 

X i—f 


NO 


• 94 ) ; 


GCAGTATCTC 


(SEQ 


ID 


NO: 95) 


PAtTTATPTPG 


(SEO 


ID 


NO 


•96) ; 


AGTATCTCGA 


(SEQ 


ID 


NO: 97) 




(SEO 


ID 


NO 


♦98); . 


TATCTCGAGA 


(SEQ 


ID 


NO:99) 


ATPTPPAHAP 


( SEO 


ID 


NO 


: 1 0 0 ) ; 


TCTCGAGACC 


(SEQ 


ID ] 


NO:101) 


C 1 CvjACjAUU 1 


( 


JLJJ 




: -L u £ } / 


TPOAHAPPTA 


(SEO 


ID NO 


- 103) • 


CGAGAC CTAG 


(SEQ 


ID 


NO 


:104) ; 


GAGACCTAGA 


(SEQ 


ID NO 


'.105) ; 


AGACCTAGAA 


(SEQ 


ID 


NO 


:106) ; 


G AC CT AGAAA 


(SEQ 


ID NO 


:107) ; 


AC CT AG AAAA 


(SEQ 


ID 


NO 


:10 8) ; 


CCTAGAAAAA 


(SEQ 


ID NO 


:109) ; 


CTAGAAAAAC 


(SEQ 


ID 


NO 


:110) ; 


TAGAAAAACA 


(SEQ 


ID NO 


till) ; 


AGAAAAACAT 


(SEQ 


ID 


NO 


:112) ; 


GAAAAACATG 


(SEQ 


ID NO 


:113) ; 
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AAAAACATGG (SEQ ID NO: 114); 

AAACATGGAG (SEQ ID NO: 116) ; 

ACATGGAGCA (SEQ ID NO: 118); 

ATGGAGCAAT (SEQ ID NO: 120); 

GGAGCAATCA (SEQ ID NO: 122); 

AGCAATCACA (SEQ ID NO: 124); 

CAATCACAAG (SEQ ID NO: 126); 

ATCACAAGTA (SEQ ID NO: 128); 

CACAAGTAGC (SEQ ID NO: 130); 

CAAGTAGCAA (SEQ ID NO: 132); 
AGTAGCAATA (SEQ ID NO: 134); 

TAGCAATACA (SEQ ID NO : 13 6 ) ; 

GCAATACAGC (SEQ ID NO: 138); 
AATACAGCAG (SEQ ID NO: 140); 
TACAGCAGCT (SEQ ID NO:142); 
CAGCAGCTAA (SEQ ID NO: 144); 
GCAGCTAACA (SEQ ID NO:146); 
AGCTAACAAT (SEQ ID NO: 148); 
CTAACAATGC (SEQ ID NO: 150); 
AACAATGCTG (SEQ ID NO: 152); 
CAATGCTGCT (SEQ ID NO: 154); 
ATGCTGCTTG (SEQ ID NO: 156); 
GCTGCTTGTG (SEQ ID NO: 158); 
TGCTTGTGCC (SEQ ID NO:160); 
CTTGTGCCTG (SEQ ID NO: 162); 
TGTGCCTGGC (SEQ ID NO: 164); 
TGCCTGGCTA (SEQ ID NO:166); 
CCTGGCTAGA (SEQ ID NO: 168); 
TGGCTAGAAG (SEQ ID NO: 170); 
GCTAGAAGCA (SEQ ID NO: 172) ; 
TAGAAGCACA (SEQ ID NO: 174); 
GAAGCACAAG (SEQ ID NO:176); 
AGCACAAGAG (SEQ ID NO: 178); 
CACAAGAGGA (SEQ ID NO:180); 
CAAGAGGAGG (SEQ ID NO: 182); 
AGAGGAGGAA (SEQ ID NO : 1 8 4 ) ; 
AGGAGGAAGA (SEQ ID NO: 186); 
GAGGAAGAGG (SEQ ID NO: 188); 



AAAACATGGA (SEQ ID NO: 115); 
AACATGGAGC (SEQ ID NO: 117); 
CATGGAGCAA (SEQ ID NO: 119); 
TGGAGCAATC (SEQ ID NO: 121); 
GAGCAATCAC (SEQ ID NO:123); 
GCAATCACAA (SEQ ID NO:125); 

AAT CACAAGT (SEQ ID NO:127); 

T CACAAGT AG (SEQ ID NO : 12 9 ) ; 

ACAAGTAGCA (SEQ ID NO : 1 3 1 ) ; 

AAG TAG CAAT (SEQ ID NO:133); 

GTAGCAATAC (SEQ ID NO:135); 

AGCAATACAG (SEQ ID NO:137); 

CAATACAGCA (SEQ ID NO:139); 

ATACAGCAGC (SEQ ID NO:141); 

ACAGCAGCTA (SEQ ID NO:143); 

AGCAGCTAAC (SEQ ID NO:145); 

CAGCTAACAA (SEQ ID NO:147); 

GCTAACAATG (SEQ ID NO:149); 

TAACAATGCT (SEQ ID NO:151); 

ACAATGCTGC (SEQ ID NO:153); 

AATGCTGCTT (SEQ ID NO:155); 

TGCTGCTTGT (SEQ ID NO:157); 

CTGCTTGTGC (SEQ ID NO:159); 

GCTTGTGCCT (SEQ ID NO: 161) ; 

TTGTGCCTGG (SEQ ID NO:163); 
GTGCCTGGCT (SEQ ID NO:16'5); 
GCCTGGCTAG (SEQ ID NO : 167); 
CTGGCTAGAA (SEQ ID NO:169); 
GGCTAGAAGC (SEQ ID NO:171); 
CTAGAAGCAC (SEQ ID NO:173); 
AGAAGCACAA (SEQ ID NO: 175); 
AAGCACAAGA (SEQ ID NO: 177); 
GCACAAGAGG (SEQ ID NO:179); 
ACAAGAGGAG (SEQ ID NO: 181); 
AAGAGGAGGA (SEQ ID NO:183); 
GAGGAGGAAG (SEQ ID NO: 185) ; 
GGAGGAAGAG (SEQ ID NO:187); 
AGGAAGAGGT (SEQ ID NO: 189) ; 
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GGAAGAGGTG (SEQ ID NO: 19 0.) ; 
AAGAGGTGGG (SEQ ID NO: 192); 
GAGGTGGGTT (SEQ ID NO: 194); 
GGTGGGTTTT (SEQ ID NO: 196); 
TGGGTTTTCC (SEQ ID NO: 198); 
GGTTTTCCAG (SEQ ID NO: 200); 
TTTTCCAGTC (SEQ ID NO:202); 
TTCCAGTCAC (SEQ ID NO: 204); 
CCAGTCACAC (SEQ ID NO: 2 06); 
AGTCACACCT (SEQ ID NO:208); 
■TCACACCTCA (SEQ ID NO:210); 
ACACCTCAGG (SEQ ID NO: 212); 
ACCTCAGGTA (SEQ ID NO: 214); 
CTCAGGTACC (SEQ ID NO: 216); 
CAGGTACCTT (SEQ ID NO : 218 ) ; 
GGTACCTTTA (SEQ ID NO: 220); 
TACCTTTAAG ( SEQ ID NO : 2 2 2 ) ; 
C CTTT AAGAC (SEQ ID NO: 224); 
TTTAAGACCA (SEQ ID NO: 226); 
TAAGACCAAT (SEQ ID NO: 228); 
AGAC CAATG A (SEQ ID NO: 230); 
ACCAATGACT (SEQ ID NO: 2 32); 
CAATGACTTA (SEQ ID NO: 234); 
ATGACTTACA ( SEQ ID NO : 2 3 6 ) ; 
GACTTACAAG ( SEQ ID NO : 2 3 8 ) ; 
CTTACAAGGC (SEQ ID NO: 240); 
TACAAGGCAG (SEQ ID NO: 242); 
CAAGGCAGCT (SEQ ID NO:244); 
AGGCAGCTGT (SEQ ID NO:246); 
GCAGCTGTAG (SEQ ID NO: 248); 
AGCTGTAGAT (SEQ ID NO:250); 
CTGTAGATCT (SEQ ID NO: 2 52) ; 
GTAGATCTTA (SEQ ID NO: 2 54); 
AGATCTTAGC (SEQ ID NO: 256); 
ATCTTAGCCA (SEQ ID NO: 258); 
CTTAGCCACT (SEQ ID NO: 260); 
TAGCCACTTT (SEQ ID NO:262); 
GCCACTTTTT (SEQ ID NO: 264) ; 
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GAAGAGGTGG (SEQ ID NO:191); 
AGAGGTGGGT (SEQ ID NO:193); 
AGGTGGGTTT (SEQ ID NO: 195); 
GTGGGTTTTC (SEQ ID NO:197); 
GGGTTTTCCA (SEQ ID NO:199); 
GTTTTCCAGT (SEQ ID NO:201); 
TTTCCAGTCA (SEQ ID NO:203); 
TCCAGTCACA (SEQ ID NO:205); 
CAGTCACACC (SEQ ID NO:207); 
GTCACACCTC (SEQ ID NO:209); 
CACACCTCAG (SEQ ID NO:211); 
CACCTCAGGT (SEQ ID NO:213); 
CCTCAGGTAC (SEQ ID NO:215); 
TCAGGTACCT (SEQ ID NO:217); 
AGGTACCTTT (SEQ ID NO:219); 
GTACCTTTAA (SEQ ID NO:221); 
ACCTTTAAGA (SEQ ID NO:223); 
CTTTAAGACC (SEQ ID NO:225); 
TTAAGACCAA (SEQ ID NO:227); 
AAGACCAATG (SEQ ID NO:229); 
GACCAATGAC (SEQ ID NO:231); 
CCAATGACTT (SEQ ID NO:233); 
AATGACTTAC ( SEQ ID NO : 2 3 5 ) ; 
TGACTTACAA (SEQ ID NO:237); 
ACTTACAAGG (SEQ ID NO:239); 
TTACAAGGCA (SEQ ID NO: 241); 
ACAAGGCAGC (SEQ ID NO: 243); 
AAGGCAGCTG ( SEQ ID NO : 2 4 5 ) ; 
GGCAGCTGTA (SEQ ID NO:247); 
CAGCTGTAGA (SEQ ID NO: 24 9) ; 
GCTGTAG AT C (SEQ ID NO: 251) ; 
TGTAGATCTT (SEQ ID NO: 2 53) ; 
TAGATCTTAG (SEQ ID NO: 2 55) ; 
GATCTTAGCC (SEQ ID NO: 257) ; 
TCTTAGCCAC (SEQ ID NO: 259) ; 
TTAGCCACTT (SEQ ID NO:261); 
AGCCACTTTT ( SEQ ID NO : 2 6 3 ) ; 
CCACTTTTTA (SEQ ID NO: 265); 



WO 95/21912 



PCT/AU95/00063 



CACTTTTTAA (SEQ ID NO: 266); 
CTTTTTAAAA (SEQ ID NO: 268); 
TTTTAAAAGA (SEQ ID NO:270); 

TTAAAAGAAA (SEQ ID NO: 272); 

AAAAGAAAAG (SEQ ID NO: 2 74); 

AAGAAAAGGG (SEQ ID NO: 276) ; 

GAAAAGGGGG (SEQ ID NO: 278); 

AAAGGGGGGA (SEQ ID NO: 280); 

AGGGGGGACT (SEQ ID NO: 282); 

GGGGGACTGG (SEQ ID NO: 2 84); 

GGGACTGGAA (SEQ ID NO: 286); 

GACTGGAAGG (SEQ ID NO: 288); 

CTGGAAGGGC ( SEQ ID NO : 2 9 0 ) ; 

GGAAGGGCTA (SEQ ID NO: 2 92); 

AAGGGCTAAT (SEQ ID NO: 2 94); 

GGGCTAATTC ( SEQ ID NO : 2 9 6 ) ; 

GCTAATTCAC (SEQ ID NO: 2 98); 

TAATTCACTC ( SEQ ID NO : 3 0 0 ) ; 
ATTCACTCCC (SEQ ID NO:302); 
TCACTCC.CAA (SEQ ID NO: 3 04); 
ACTCCCAAAG (SEQ ID NO: 3 06); 
TCCCAAAGAA (SEQ ID NO: 3 08); 
CCAAAGAAGA ( SEQ ID NO : 3 1 0 ) ; 
AAAGAAGACA (SEQ ID NO: 3 12); 
AGAAGACAAG (SEQ ID NO: 3 14); 
AAGACAAGAT (SEQ ID NO: 3 16); 
GACAAGATAT (SEQ ID NO: 3 18); 
CAAGATATCC (SEQ ID NO: 320); 
AGATATCCTT (SEQ ID NO: 3 22) ; 
ATATCGTTGA (SEQ ID NO: 3 24) ; 
ATCCTTGATC (SEQ ID NO: 326) ; 
CCTTGATCTG (SEQ ID NO: 3 28); 
TTGATCTGTG (SEQ ID NO: 330); 
GAT CTGTGGA (SEQ ID NO: 332) ; 
TCTGTGGATC (SEQ ID NO: 334); 
TGTGGATCTA (SEQ ID NO:336); 
TGGATCTACC (SEQ ID NO:338); 
GATCTACCAC (SEQ ID NO:340); 



232 - 

ACTTTTTAAA (SEQ ID NO: 267) 
TTTTTAAAAG (SEQ ID NO: 269) 
TTTAAAAGAA (SEQ ID NO: 271) 
TAAAAGAAAA (SEQ ID NO: 2 73) 
AAAGAAAAGG (SEQ ID NO: 2 75) 
AGAAAAGGGG (SEQ ID NO: 2 77) 
AAAAGGGGGG (SEQ ID NO: 279) 
AAGGGGGGAC (SEQ ID NO: 2 81) 
GGGGGGACTG (SEQ ID NO: 283) 
GGGGACTGGA (SEQ ID NO: 2 85) 
GGACTGGAAG (SEQ ID NO: 287) 
ACTGGAAGGG (SEQ ID NO: 289) 
TGGAAGGGCT (SEQ ID NO: 2 91), 
GAAGGGCTAA (SEQ ID NO: 2 93), 
AGGGCTAATT (SEQ ID NO: 2 95); 
GGCTAATTCA (SEQ ID NO: 2 97); 
CTAATT CACT ( SEQ ID NO : 2 9 9 ) ; 

AATTCACTCC (SEQ ID NO.-301); 

TTCACTCCCA (SEQ ID NO:303); 

CACTCCCAAA (SEQ ID NO:305); 

CTCCCAAAGA (SEQ ID NO:307); 

CCCAAAGAAG (SEQ ID NO:309); 

CAAAGAAGAC (SEQ ID NO:311); 

AAGAAGACAA (SEQ ID NO:313); 

GAAGACAAGA (SEQ ID NO:315); 

AGACAAGATA ( SEQ ID NO : 3 1 7 ) ; 

ACAAGATATC ( SEQ ID NO : 3 1 9 ) ; 

AAGATATCCT (SEQ ID NO : 3 2 1 ) ; 

GATATCCTTG (SEQ ID NO : 323); 

TATCCTTGAT (SEQ ID NO:325); 

TCCTTGATCT (SEQ ID NO: 327) ; 

CTTGATCTGT (SEQ ID NO:329); 

TGAT CTGTGG (SEQ ID NO:331); 

ATCTGTGGAT (SEQ ID NO:333); 

CTGTGGATCT (SEQ ID NO:33 5); 

GTGGATCTAC (SEQ ID NO:337); 

GGATCTACCA (SEQ ID NO : 3 3 9 ) ; 
ATCTACCACA (SEQ ID NO:341); 
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TCTACCACAC 


(SEQ 


ID 


NO: 


342) ; 


CTACCACACA 


(SEQ 


ID 


NO: 


343) ; 


TACCACACAC 


(SEQ 


ID 


NO: 


344) ; 


ACCACACACA 


(SEQ 


ID 


NO: 


345) ; 


CCACACACAA 


(SEQ 


ID 


NO: 


346) ; 


CACACACAAG 


(SEQ 


ID 


NO: 


347) ; 


ACACACAAGG 


(SEQ 


ID 


NO : 


34 8) ; 


CACACAAGGC 


(SEQ 


ID 


NO: 


34 9) ;. 


ACACAAGGCT 


(SEQ 


ID 


NO : 


350) ; 


CACAAGGCTA 


(SEQ 


ID 


NO: 


351) ; 


ACAAGGCTAC 


(SEQ 


ID 


NO: 


352) ; 


CAAGGCTACT 


(SEQ 


ID 


NO: 


353) ; 


AAGGCTACTT 


(SEQ 


ID 


NO: 


354) ; 


AGGCTACTTC 


(SEQ 


ID 


NO: 


355) ; 


GGCTAGTTCC 


(SEQ 


ID 


NO: 


356) ; 


GCTACTTCCC 


(SEQ 


ID 


NO: 


357) ; 


CTACTTCCCT 


(SEQ 


ID 


NO: 


358) ; 


TACTTCCCTG 


(SEQ 


ID 


NO: 


359) ; 


ACTTCCCTGA 


(SEQ 


ID 


NO: 


360) ; 


CTTCCCTGAT 


(SEQ 


ID 


NO: 


361) ; 


TTCCCTGATT 


(SEQ 


ID 


NO: 


362) ; 


TCCCTGATTG 


(SEQ 


ID 


NO: 


363) ; 


CCCTGATTGG 


(SEQ 


ID 


NO: 


364) ; 


CCTGATTGGC 


(SEQ 


ID 


NO: 


365) ; 


CTGATTGGCA 


(SEQ 


ID 


NO: 


366) ; 


TGATTGGCAG 


(SEQ 


ID 


NO: 


367) ; 


GATTGGCAGA 


(SEO 


ID 


NO : 


36 8) ; 


ATTGGCAGAA 


(SEQ 


ID 


NO: 


369) ; 


TTGGCAGAAC 


(SEQ 


ID 


NO: 


370) ; 


TGGCAGAACT 


(SEQ 


ID 


NO: 


371) ; 


GGCAGAACTA 


(SEQ 


ID 


NO: 


372) ; 


GCAGAACTAC 


(SEQ 


ID 


NO: 


373) ; 


CAGAACTACA 


(SEQ 


ID 


NO: 


3 74) ; 


AGAACTACAC 


(SEQ 


ID 


NO: 


375) ; 


GAACTACACA 


(SEQ 


ID 


NO: 


376) ; 


AACTACACAC 


(SEQ 


ID 


NO: 


377) ; 






IT) 

_L i-J 


NO * 


378) • 


CTACACACCA 


(SEQ 


ID 


NO: 


379) ; 






Tfi 


NTH 


j O U / / 


ACACACCAGG 


(SEO 


ID 


NO: 


381) ; 


CACAC CAGGrG 


I SEQ 


T 

1JJ 




- 

JOZI / 




(SEO 


ID 


NO : 


3 83) ; 




(bbQ 


T "Pi 




J Ot / / 




(SEO 


ID 


NO : 


3 85) ; 






TD 


NO 


3 8 6) ; 


CAGGGCCAGG 


(SEQ 


ID 


NO: 


3 87) ; 


AGGGCCAGGG 


(SEQ 


ID 


NO 


: 3 8 8 ) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO: 


389) ; 


GGCCAGGGGT 


(SEQ 


ID 


NO 


:390) ; 


GCCAGGGGTC 


(SEQ 


ID 


NO: 


391) ; 


CCAGGGGTCA 


(SEQ 


ID 


NO 


:392) ; 


CAGGGGTCAG 


(SEQ 


ID 


NO 


:393) ; 


AGGGGTCAGA 


(SEQ 


ID 


NO 


:394) ; 


GGGGTCAGAT 


(SEQ 


ID 


NO 


:395) ; 


GGGTCAGATA 


(SEQ 


ID 


NO 


:396) ; 


GGTCAGATAT 


(SEQ 


ID 


NO 


:397) ; 


GTCAGATATC 


(SEQ 


ID 


NO 


: 398) ; 


TCAGATATCC 


(SEQ 


ID 


NO 


:399) ; 


CAGATATCCA 


(SEQ 


ID 


NO 


:400) ; 


AGATATCCAC 


(SEQ 


ID 


NO 


:401) ; 


GATATCCACT 


(SEQ 


ID 


NO 


:402) ; 


ATATCCACTG 


(SEQ 


ID 


NO 


:403) ; 


TATCCACTGA 


(SEQ 


ID 


NO 


:404) ; 


ATCCACTGAC 


(SEQ 


ID 


•NO 


:405) ; 






TD 


NO 


: 4 0 6 ) ; 


CCACTGACCT 


(SEQ 


ID 


NO 


:407) ; 


CACTGACCTT 


(SEQ 


ID 


NO 


:408) ; 


ACTGAC CTTT 


(SEQ 


ID 


NO 


:409) ; 


CTGACCTTTG 


(SEQ 


ID 


NO 


:410) ; 


TGACCTTTGG 


(SEQ 


ID 


NO 


:411) ; 


GACCTTTGGA 


(SEQ 


ID 


NO 


:412) ; 


ACCTTTGGAT 


(SEQ 


ID 


NO 


:413) ; 


CCTTTGGATG 


(SEQ 


ID 


NO 


:414) ; 


CTTTGGATGG 


(SEQ 


ID 


NO 


:415) ; 


TTTGGATGGT 


(SEQ 


ID 


NO 


:416) ; 


TTGGATGGTG 


(SEQ 


ID 


NO 


:417) ; 
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TGGATGGTGC (SEQ ID NO: 418) 
GATGGTGCTA (SEQ ID NO: 42 0) 
TGGTGCTACA (SEQ ID NO: 422) 
GTGCTACAAG (SEQ ID NO: 424) 
GCTACAAGCT (SEQ ID NO: 42 6) 
TACAAGCTAG (SEQ ID' NO: 428) 
CAAG CTAGT A (SEQ ID NO: 43 0) 
AGCTAGTACC (SEQ ID NO: 432) 
CTAGTACCAG (SEQ ID NO: 434) 
AGTACCAGTT (SEQ ID NO: 436) 
TACCAGTTGA (SEQ ID NO: 43 8) 
CCAGTTGAGC (SEQ ID NO: 44 0) 
AGTTGAGCCA (SEQ ID NO: 442) 
TTGAGC CAG A (SEQ ID NO: 444) 
GAGCCAGATA (SEQ ID NO: 44 6) 
GCCAGATAAG (SEQ ID NO: 44 8) 
CAGATAAGGT (SEQ ID NO: 4 50) 
GATAAGGTAG (SEQ ID NO: 452) 
TAAGGTAGAA (SEQ ID NO: 454) 
AGGTAGAAGA (SEQ ID NO: 456) 
GTAGAAGAGG (SEQ ID NO: 4 58) 
AGAAGAGGCC (SEQ ID NO: 460) 
AAGAGGCCAA (SEQ ID NO: 462) 
GAGGCCAATA (SEQ ID NO: 4 64) 
GGCCAATAAA (SEQ ID NO: 466) 
CCAATAAAGG (SEQ ID NO:468), 

AATAAAGGAG ( SEQ ID NO : 4 7 0 ) , 

TAAAGGAGAG (SEQ ID NO: 4 72), 

AAGGAGAGAA (SEQ ID NO: 4 74) ; 

GGAGAGAACA (SEQ ID NO: 4 76); 

AGAGAACACC (SEQ ID NO:478); 

AGAACACCAG (SEQ ID NO: 480); 

AACACCAGCT (SEQ ID NO: 4 82) ; 

CACCAGCTTG (SEQ ID NO: 484); 

CCAGCTTGTT (SEQ ID NO:486); 

AGCTTGTTAC (SEQ ID NO: 4 88); 

CTTGTTACAC (SEQ ID NO: 4 90); 

TGTTACACCC (SEQ ID NO: 4 92); 
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GGATGGTGCT (SEQ ID NO: 419) 
ATGGTGCTAC (SEQ ID NO: 421) 
GGTGCTACAA (SEQ ID NO: 42 3) 
TGCTACAAGC (SEQ ID NO: 425) 
CTACAAGCTA (SEQ ID NO: 42 7) 
ACAAG CTAGT (SEQ ID NO: 42 9) 
AAGCTAGTAC (SEQ ID NO: 431) 
GCTAGTACCA (SEQ ID NO: 43 3) 
TAGTACCAGT (SEQ ID NO: 43 5) 
GTACCAGTTG (SEQ ID NO: 43 7) 
ACCAGTTGAG (SEQ ID NO: 43 9) 
CAGTTGAGCC (SEQ ID NO: 441) 
GTTGAGCCAG (SEQ ID NO: 443) 
TG AG C CAG AT (SEQ ID NO: 44 5) 
AGCCAGATAA (SEQ ID NO: 447) 
CCAGATAAGG (SEQ ID NO: 44 9) 
AGATAAGGTA (SEQ ID NO: 451) 
ATAAGGTAGA (SEQ ID NO: 4 53) 
AAGGTAGAAG (SEQ ID NO: 455) 
GGTAGAAGAG (SEQ ID NO: 457) 
TAGAAGAGGC (SEQ ID NO: 459) 
GAAGAGGCCA (SEQ ID NO: 4 61) 
AGAGGCCAAT (SEQ ID NO: 463) 
AGGCCAATAA (SEQ ID NO: 465) 
GCCAATAAAG (SEQ ID NO: 467) 
CAATAAAGGA (SEQ ID NO: 469) 
ATAAAGGAGA (SEQ ID NO: 471) 
AAAGGAGAGA (SEQ ID NO: 473) 
AGGAGAGAAC (SEQ ID NO: 475), 
GAGAGAACAC (SEQ ID NO: 4 77), 
GAGAACACCA (SEQ ID NO:479), 
GAACACCAGC (SEQ ID NO: 4 81) . 
ACACCAGCTT (SEQ ID NO:483), 
ACCAGCTTGT (SEQ ID NO: 4 85), 
CAGCTTGTTA (SEQ ID NO: 4 87), 
GCTTGTTACA (SEQ ID NO: 4 89) . 
TTGTTACACC (SEQ ID NO: 4 91), 
GTTACACCCT (SEQ ID NO: 4 93) ; 
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TTACACCCTG 


(SEQ 


ID 


NO , 


494) ■ 


TACACCCTGT 


( SEQ 


ID 


NO ■ 


4 95) 


ACACCCTGTG 


(SEO 


ID 


NO : 


4 96) ; 




( ^FO 


X LJ 


NO • 

X>i w . 


*± Z7 / J 


ACCCTGTGAG 


(SEO 


ID 


NO : 


4 98) ; 


CCCTGTGAGC 


(SEQ 


ID 


NO : 


4 99) 

■x J J / 


CCTGTGAGCC 


(SEQ 


ID 


NO: 


500) ; 


CTGTGAGCCT 


(SEQ 


ID 


NO : 


501) 


TGTGAGCCTG 


(SEQ 


ID 


NO : 


502) ; 


GTGAGCCTGC 


(SEQ 


ID 


NO : 


503 ) 


TGAGCCTGCA 


(SEQ 


ID 


NO: 


504) ; 


GAGCCTGCAT 


(SEQ 


ID 


NO : 


505) 


AGCCTGCATG 


(SEQ 


ID 


NO: 


506) ; 


GCCTGCATGG 


(SEQ 


ID 


NO: 


507) 


CCTGCATGGA 


(SEQ 


ID 


NO: 


508) ; 


CTGCATGGAA 


(SEQ 


ID 


NO: 


509) 


TGCATGGAAT 


(SEQ 


ID 


NO: 


510) ; 


GCATGGAATG 


(SEQ 


ID 


NO: 


511) 


CATGGAATGG 


(SEQ 


ID 


NO: 


512) ; 


ATGGAATGGA 


(SEQ 


ID 


NO: 


513) 


TGGAATGGAT 


(SEQ 


ID 


NO: 


514) ; 


GGAATGGATG 


(SEQ 


ID 


NO: 


515) 


GAATGGATGA 


(SEQ 


ID 


NO : 


516) ; 


AATGGATGAC 

***X X Vw* XJ^i X 1 wii\»* 


(SEQ 


ID 


NO : 


517 ) 


ATGGATGACC 


(SEQ 


TD 

X LJ 


NO • 


SIR)* 




V w 


TD 

X i-J 


NO - 


z> x y } 


GGATGACOOT 


( ^PD 

\ Oljy 


XLJ 


NO • 




Vjxt. X V— V- X \j 




TTi 
XL) 


ViKJ . 




ATGACCCTGA 


v 


TD 
X u 


Vi\j . 


3 ^ z y , 


HHC >r TCZ1±CZ 
X VjiA^-\— V_- X \JJr\\J 




T*n 

XL) 




c; o "3 ^ 


GACCCTGAGA 




TD 

X LJ 


"MO • 

Vi \J • 


roa \ * 
«j ^ *± / / 




/ qpd 


TP) 

xu 


NO - 


Zj ZJ f 


CCCTGAGAGA 


(SEQ 


ID 


NO • 


5?6 ) • 






TD 

X LJ 


NO - 


Z? I J 


CTGAGAGAGA 


(SEO 

v ° v: 


ID 


NO : 


52 8 ) * 




f SEO 


TD 

X LJ 


NO • 


~J ZJ f 


GAGAGAGAAG 


(SEQ 


ID 


NO : 


530)* 


AGAGAGAAGT 


fSEO 

\ *J X-J 


TD 

X LJ 


NO • 

Xi w • 


531) 

_> J X J 


GAGAGAAGTG 


(SEQ 


ID 


NO : 


532) ; 


AGAGAAGTGT 


( SEO 


TD 

X LJ 


NO - 

L* \J • 


j o o ; 


GAGAAGTGTT 


(SEQ 


ID 


NO : 


5"*4 ) * 


AGA AGTGTTA 


f SEO 


TD 

X LJ 


NO • 


53 5) 


GAAGTGTTAG 


( SEQ 


ID 


NO : 


536 ) • 


AAGTGTTAGA 


(SEO 


TD 

X LJ 


NO * 


53 7) 


AGTGTTAGAG 


(SEO 


ID 


NO : 


53 8) ; 


GTGTTAGAGT 

"J X VJ X X tt\J£WJ X 


( SEO 

\ >*?x>v^ 


TD 

X XJ 


NO * 


5 3 Q ) 


TGTTAGAOTO 


V oc«y 


X LJ 


no - 


Zj*±\J J f 


Oil f\\Jr\\J X Ovjt 


^ cpn 


TD 
XL) 


"MO * 


O^t X J 






XL) 


JNU : 


a a o \ - 
Dftz ; ; 




v ot*y 


ID 


WU : 


543 ) 






XL) 


TSTO . 

jnu : 


o44 ; ; 


\jPiSj L VjQjAVjvj 1 




1JJ 


NO : 


r >i r- \ 






x JJ 


INC - 


o / ; 


t»7 1 o^ALj^j* 111 




TT"\ 
XL) 


InO : 


54 / ; 






TP) 
XXJ 


MO • 
IN . 


0*xO ) , 


OljAvjVjl 1 XVjA 




1JJ 


JNU : 




GAGGTTTGAC 


( SEO 
\ t*-' i-j 


TD 


NO • 

XX \J * 


5 5 o ) • 
~j ~j \j j t 


AHnTTTHAPA 


/ qpn 

\ lD X-i^/ 


TD 
x U 


NO • 


3 O X / 


GGTTTGACAG 


(SEQ 


ID 


NO : 


552) ; 


GTTTGACAGC 


(SEQ 


ID 

X LJ 


NO • 

XX w • 


5 53^ 

-J J -O / 


TTTGACAGCC 


(SEQ 


ID 


NO : 


554) ; 


TTGACAGCCG 


(SEQ 


ID 


NO: 


555 ) 

-j -j -j / 


TGACAGCCGC 


(SEQ 


ID 


NO : 


556) ; 


GACAGCCGCC 


(SEQ 


ID 


NO : 


557 ) 






TD 

XL) 


MPi • 
JNU . 




t,iivj L- V- U U 1 A 




XL) 


JNU : 


c c o \ 


AGCCGCCTAG 


(SEQ 


ID 


NO: 


560) ; 


GCCGCCTAGC 


(SEQ 


ID 


NO: 


561) 


CCGCCTAGCA 


(SEQ 


ID 


NO: 


562) ; 


CGCCTAGCAT 


(SEQ 


ID 


NO: 


563) 


GCCTAGCATT 


(SEQ 


ID 


NO: 


564) ; 


CCTAGCATTT 


(SEQ 


ID 


NO: 


565) 


CTAGCATTTC 


(SEQ 


ID 


NO: 


566) ; 


TAGCATTTCA 


(SEQ 


ID 


NO: 


567) 


AGCATTTCAT 


(SEQ 


ID 


NO: 


568) ; 


GCATTTCATC 


(SEQ 


ID 


NO: 


569) 
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CATTT CAT C A 


(SEQ 


ID 


NO 


: 570 ) ; 


ATTTCATCAC 


(SEQ 


ID 


NO : 


5 71) ; 


TTTCATCACG 


(SEQ 


ID 


NO 


: 572 ) ; 


TTCATCACGT 


(SEQ 


ID 


NU : 


b / 6 ) ; 


TCATCACGTG 


(SEQ 


ID 


NO 


: 5 74 ) ; 


CAT C ACGTGG 


(SEQ 


ID 


NU : 


d / d ) ; 


ATCACGTGGC 


(SEQ 


ID 


NO 


: 576) ; 


TCACGTGGCC 


(SEQ 


ID 


NO : 


577) ; 


CACG7GGCCC 


(SEQ 


ID 


NO 


: 578) ; 


ACGTGGCCCG 


(SEQ 


ID 


NO : 


XT *~l \ 

57 9) ; 


CGTGGCCCGA 


(SEQ 


ID 


NO 


:580) ; 


GTGGCCCGAG 


(SEQ 


ID 


NO : 


581) ; 


TGGCCCGAGA 


(SEQ 


ID 


NO 


:582) ; 


GGCCCG AG AG 


(SEQ 


ID 


NO : 


583 ) ; 


GCCCGAGAGC 


(SEQ 


ID 


NO 


: 5 8 4 ) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO : 


5 85) ; 


CCGAGAGCTG 


(SEQ 


ID 


NO 


: 5 8 6 ) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO : 


C O *~7 \ 

5 87) ; 


GAGAGCTGCA 


(SEQ 


ID 


NO 


: 5 8 8 ) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO : 


5 8 9) ; 


GAGCTGCATC 


(SEQ 


ID 


NO 


:590) ; 


AGCTGCATCC 


(SEQ 


ID 


NO : 


591) ; 


GCTGCATCCG 


(SEQ 


ID 


NO 


:592) ; 


CTGCATCCGG 


(SEQ 


ID 


NO : 


593 ) ; 


TGCATCCGGA 


(SEQ 


ID 


NO 


:594) ; 


GCATCCGGAG 


(SEQ 


ID 


NO : 


595) ; 


CATCCGGAGT 


(SEQ 


ID 


NO 


:596) ; 


ATC CGGAGTA 


(SEQ 


ID 


NO : 


597) ; 


TCCGGAGTAC 


(SEQ 


ID 


NO 


: 598) ; 


CCGGAGTACT 


(SEQ 


ID 


NO : 


599) ; 


CGGAGTACTT 


(SEQ 


ID 


NO 


: 600) ; 


GGAGTACTTC 


(SEQ 


ID 


NO : 


601) ; 


GAGTACTTCA 


(SEQ 


ID 


NO 


: 602) ; 


AGTACTTCAA 


(SEQ 


ID 


NO : 


603) ; 


GTACTTCAAG 


(SEQ 


ID 


NO 


:604) ; 


TACTTCAAGA 


(SEQ 


ID 


NO : 


605) ; 


ACTTCAAGAA 


(SEQ 


ID 


NO 


:606) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 


607) ; 


TTCAAGAACT 


(SEQ 


ID 


NO 


:608) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 


609) ; 


CAAGAACTGC 


(SEQ 


ID 


NO 


:610) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 


611) ; 


AGAACTGCTG 


(SEQ 


ID 


NO 


: 612) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 


613) . 



85. A strain of HIV-1 according to claim 79 wherein said deletion encompasses at 
least one of the following decanucleotides from the LTR region of HIV-1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID 


NO: 


652) ; 


CTTTTTGCCT 


(SEQ 


ID 


NO: 


653) 


TTTTTGCCTG 


(SEQ 


ID 


NO: 


654) ; 


TTTTGCCTGT 


(SEQ 


ID 


NO: 


655) 


TTTGCCTGTA 


(SEQ 


ID 


NO: 


656) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO: 


657) 


TGCCTGTACT 


(SEQ 


ID 


NO: 


658) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 


659) 


CCTGTACTGG 


(SEQ 


ID 


NO: 


660) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 


661) 


TGTACTGGGT 


(SEQ 


ID 


NO: 


662) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 


663) 


TACTGGGTCT 


(SEQ 


ID 


NO: 


664) ; 


ACTGGGTCTC 


(SEQ 


ID 


NO: 


665) 


CTGGGTCTCT 


(SEQ 


ID 


NO: 


666) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO: 


667) 


GGGTCTCTCT 


(SEQ 


ID 


NO: 


668) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO: 


669) 


GTCTCTCTGG 


(SEQ 


ID 


NO: 


670) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 


671) 


CTCTCTGGTT 


(SEQ 


ID 


NO: 


672) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


673) 


CTCTGGTTAG 


(SEQ 


ID 


NO: 


674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


675) 
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CTGGTTAGAC 


(SEQ 


ID 


NO: 


676) ; 


TGGTTAGACC 


(SEQ 


ID 


NO: 


677) 


GGTTAGACCA 


(SEQ 


ID 


NO: 


678) ; 


GTTAGACCAG 


(SEQ 


ID 


NO: 


679) 


TTAGACCAGA 


(SEQ 


ID 


NO: 


680) ; 


TAGACCAGAT 


(SEQ 


ID 


NO: 


681) 


AGACCAGATC 


(SEQ 


ID 


NO: 


682) ; 


GACCAGATCT 


(SEQ 


ID 


NO: 


683) 


ACCAGATCTG 


(SEQ 


ID 


NO: 


684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO: 


685) 


CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO: 


687) 


GATCTGAGCC 


(SEQ 


ID 


NO: 


688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO: 


689) 


TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO: 


691) 


TGAGCCTGGG 


(SEQ 


ID 


NO: 


6 92) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 


693) 


AGCCTGGGAG 


(SEQ 


ID 


NO: 


6 94) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO: 


695) 


CCTGGGAGCT 


(SEQ 


ID 


NO: 


696} ; 


CTGGGAGCTC 


(SEQ 


ID 


NO: 


697) 


TGGGAGCTCT 


(SEQ 


ID 


NO: 


698) ; 


GGGAGCTCTC 


(SEQ 


ID 


NO: 


699) 


GGAGCTCTCT 


(SEQ 


ID 


NO: 


700) ; 


GAGCTCTCTG 


(SEQ 


ID 


NO: 


701) 


AGCTCTCTGG 


(SEQ 


ID 


NO: 


702) ; 


GCTCTCTGGC 


(SEQ 


ID 


NO: 


703) 


CTCTCTGGCT 


(SEQ 


ID 


NO: 


704) ; 


TCTCTGGCTA 


(SEQ 


ID 


NO: 


705) 


CTCTGGCTAA 


(SEQ 


ID 


NO: 


706) ; 


TCTGGCTAAC 


(SEQ 


ID 


NO: 


707) 


CTGGCTAACT 


(SEQ 


ID 


NO: 


708) ; 


TGGCTAACTA 


(SEQ 


ID 


NO: 


709) 


GGCTAACTAG 


(SEQ 


ID 


NO : 


710) ; 


GCTAACTAGG 


(SEQ 


ID 


NO: 


711) 


CT AA CTAGGG 


(SEO 


ID 


NO : 


712) ; 


TAACTAGGGA 


(SEQ 


ID 


NO: 


713) 




( c;T?n 

V oILV 


jn 

XLJ 


MO * 




ACTAGGGAAC 


(SEO 

\ ^ J — t W 


ID 


NO : 


715) 






Tn 
XJJ 


MO • 

iNvj : 


/id / / 




( 

V ODy 


ID 


NO : 


717 ) 


AGGGAACCCA 


V SEQ 


ID 


INJU : 


/ ±0 ) ; 




( QPf) 


TD 


NO • 


71 9 ) 


GGAALCCAL 1 




± U 


MO ♦ 


f £.VJ ) f 


A A POO A pto 




ID 

X, LJ 


NO : 


721 ) 






X u 


MO • 


7? 0 ) • 


ACCCACTGCT 


(SEO 


ID 


NO : 


723) 


CCCACTGCTT 


(SEQ 


ID 


NO: 


724) ; 


CGACTGCTTA 


(SEQ 


ID 


NO: 


725) 


CACTGCTTAA 


(SEQ 


ID 


NO: 


726) ; 


ACTGCTTAAG 


(SEQ 


ID 


NO: 


727) 


CTGCTTAAGC 


(SEQ 


ID 


NO: 


728) ; 


TGCTTAAGCC 


(SEQ 


ID 


NO: 


729) 


GCTTAAGCCT 


(SEQ 


ID 


NO: 


730) ; 


CTTAAGCCTC 


(SEQ 


ID 


NO: 


731) 


TTAAGCCTCA 


(SEQ 


ID 


NO: 


732) ; 


TAAGCCTCAA 


(SEQ 


ID 


NO: 


733) 


AAGCCTCAAT 


(SEQ 


ID 


NO: 


734) ; 


AGCCTCAATA 


(SEQ 


ID 


NO: 


735) 


GCCTCAATAA 


(SEO 


ID 


NO: 


73 6) ; 


CCTCAATAAA 


(SEQ 


ID 


NO 


.737) 


CT C AAT AAAG 


(SEQ 


ID 


NO: 


73 8) ; 


TCAATAAAGC 


(SEQ 


ID 


NO 


:739) 






JL i-J 


mo « 




AAT AAAG C TT 


(SEO 


ID 


NO 


: 741) 


ATAAAGCTTG 


(SEQ 


ID 


NO; 


742) ; 


TAAAGCTTGC 


(SEQ 


ID 


NO 


:743) 


. AAAGCTTGCC 


(SEQ 


ID 


NO 


.744) ; 


AAGCTTGCCT 


(SEQ 


ID 


NO 


:745) 


AGCTTGCCTT 


(SEQ 


ID 


NO 


:746) ; 


GCTTGCCTTG 


(SEQ 


. ID 


NO 


:747) 


CTTGCCTTGA 


(SEQ 


ID 


NO 


:748) ; 


TTGCCTTGAG 


(SEQ 


ID 


NO 


:749) 


TGCCTTGAGT 


(SEQ 


ID 


NO 


:750) ; 


GCCTTGAGTG 


(SEQ 


ID 


NO 


:751) 
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CCTTGAGTGC 


(SEQ 


ID 


NO 


752) ; 


CTTGAGTGCT 


(SEQ 


ID 


NO 


:753) 


TTGAGTGCTT 


(SEQ 


ID 


NO: 


754} ; 


TGAGTGCTTC 


(SEQ 


ID 


NO 


755) 


GAGTGCTTCA 


(SEQ 


ID 


NO: 


756) ; 


AGTGCTTCAA 


(SEQ 


ID 


NO 


757) , 


GTGCTTCAAG 


(SEQ 


ID 


NO: 


758) ; 


TGCTTCAAGT 


(SEQ 


ID 


NO: 


759) , 


GCTTCAAGTA 


(SEQ 


ID 


NO: 


76 0) ; 


CTTCAAGTAG 


(SEQ 


ID 


NO: 


761) , 


TTCAAGTAGT 


(SEQ 


ID 


NO: 


762) ; 


TCAAGTAGTG 


(SEQ 


ID 


NO: 


763) , 


CAAGTAGTGT 


(SEQ 


ID 


NO: 


764) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


765) , 


AGTAGTGTGT 


(SEQ 


ID 


NO: 


766) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO: 


767) , 


TAGTGTGTGC 


(SEQ 


ID 


NO: 


768) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO: 


769) ; 


GTGTGTGCCC 


(SEQ 


ID 


NO: 


770) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 


771) ; 


GTGTGCCCGT 


(SEQ 


ID 


NO: 


772) ; 


TGTGCCCGTC 


(SEQ 


ID 


NO: 


773) ; 


GTGCCCGTCT 


(SEQ 


ID 


NO: 


774) ; 


TGCCCGTCTG 


(SEQ 


ID 


NO: 


775) ; 


GCCCGTCTGT 


(SEQ 


ID 


NO: 


776) ; 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) ; 


CCGTCTGTTG 


(SEQ 


ID 


NO: 


778) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) ; 


GTCTGTTGTG 


(SEQ 


ID 


NO: 


780) ; 


TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) ; 


CTGTTGTGTG 


(SEQ 


ID 


NO: 


782) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 


783) ; 


GTTGTGTGAC 


(SEQ 


ID 


NO: 


784) ; 


TTGTGTGACT 


(SEQ 


ID 


NO: 


785) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 


786) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


787) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 


788) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) ; 


TGTGACTCTG 


(SEQ 


ID 


NO: 


790) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 


TGACTCTGGT 


(SEQ 


ID 


NO: 


792) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 


793) ; 


ACTCTGGTAA 


(SEQ 


ID 


NO: 


794) ; 


CTCTGGTAAC 


(SEQ 


ID 


NO: 


795) ; 


TCTGGTAACT 


(SEQ 


ID 


NO: 


796) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 


797) ; 


TGGTAACTAG 


(SEQ 


ID 


NO: 


798) ; 


GGTAACTAGA 


(SEQ 


ID 


NO: 


799) . 



86. A method for determining the pathogenicity of an HIV-1 strain after said HIV-1 
strain infects cells of an individual, said method comprising determining the presence 
of a deletion mutation in its genome of at least 10 nucleotides within the region from 
nucleotide 8787 to nucleotide 9709 using the nucleotide numbering of HIV-1 strain 
NL4-3 wherein the presence of said deletion mutation is indicative of the presence of 
a non-pathogenic strain of HIV-1. 

87. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 



BNSDOCID: <WO 952l912AlJ_> „ 



WO 95/21912 PCT/AU95/00063 

- 239 - 



nucleotide (i) 8830-8862 
(ii) 9009-9035 



(iii) 9019-9029 

(iv) 9033-9049. 



and 



88. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

89. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

90. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

91. A strain of HIV-1 according to claim 86 wherein said deletion encompasses at 
least one of the following decanucleotides from the nef gene of HIV-1 strain NL4-3: 
ATGGGTGGCA {SEQ ID NO: 2); TGGGTGGCAA (SEQ ID NO: 3); 
GGGTGGCAAG (SEQ ID NO: 4); GGTGGCAAGT ( SEQ ID NO: 5); 
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GTGGCAAGTG ( SEQ ID NO : 6 ) ; 

GGCAAGTGGT (SEQ ID NO: 8); 

CAAGTGGTCA (SEQ ID NO: 10); 

AGTGGTCAAA (SEQ ID NO: 12); 

TGGTCAAAAA ( SEQ ID NO : 14 ) ; 

GTCAAAAAGT (SEQ ID NO:16); 

CAAAAAGTAG (SEQ ID NO: 18); 

AAAAGTAGTG (SEQ ID NO: 20) ; 

AAGTAGTGTG ( SEQ ID NO : 2 2 ) ; 

GTAGTGTGAT (SEQ ID NO: 24); 

AGTGTGATTG (SEQ ID NO: 26); 

TGTGATTGGA (SEQ ID NO: 28); 

TGATTGGATG (SEQ ID NO: 30); 

ATTGGATGGC ( SEQ ID NO : 3 2 ) ; 

TGGATGGCCT (SEQ ID NO: 34); 

GATGGCCTGC ( SEQ ID NO : 3 6 ) ; 

TGGCCTGCTG (SEQ ID NO: 38); 

GCGTGCTGTA (SEQ ID NO:40); 

CTGCTGTAAG (SEQ ID NO: 42); 

GCTGTAAGGG (SEQ ID NO: 44) ; 

TGTAAGGGAA (SEQ ID NO : 4 6 ) ; 

TAAGGGAAAG ( SEQ ID NO : 4 8 ) ; 
AGGGAAAGAA (SEQ ID NO: 50); 
GGAAAGAATG (SEQ ID NO: 52); 
AAAGAATGAG (SEQ ID NO: 54); 
AGAATGAGAC (SEQ ID NO: 56); 
AATGAGACGA (SEQ ID NO: 58); 
TGAGACGAGC (SEQ ID NO: 60); 
AGACGAGCTG (SEQ ID NO: 62); 
ACGAGCTGAG (SEQ ID NO: 64); 
GAGCTGAGCC (SEQ ID NO: 66); 
GCTGAGCCAG (SEQ ID NO: 68); 
TGAGCCAGCA (SEQ ID NO: 70); 
AGCCAGCAGC (SEQ ID NO: 72); 
CCAGCAGCAG (SEQ ID NO:74); 
AGCAGCAGAT (SEQ ID NO: 76); 
CAGCAGATGG (SEQ ID NO: 78) ; 
GCAGATGGGG (SEQ ID NO: 80); 



TGGCAAGTGG (SEQ ID NO: 7) 



GCAAGTGGTC 


(SEQ 


ID NO 


:9) 




AAGTGGTCAA 


(SEQ 


ID 


NO 


ill) 


GTGGTCAAAA 


(SEQ 


ID 


NO 


:13) 


GGTCAAAAAG 


(SEQ 


ID 


NO 


:15) 


TCAAAAAGTA 


(SEQ 


ID 


NO 


:17) 


AAAAAGTAGT 


(SEQ 


ID NO 


:19) 




AAAGTAGTGT 


(SEQ 


ID 


NO 


21) 


AGTAGTGTGA 


(SEQ 


ID 


NO 


23) 


TAGTGTGATT 


(SEQ : 


ID NO 


:25) 




GTGTGATTGG 


(SEQ 


ID 


NO: 


27) 


GTGATTGGAT 


(SEQ 


ID 


NO: 


29) 


GATTGGATGG 


(SEQ 


ID 


NO: 


31) 


TTGGATGGCC 


(SEQ 


ID 


NO: 


33) 


GGATGGCCTG 


(SEQ 


ID 


NO: 


35) 


ATGGCCTGCT 


(SEQ 


ID 


NO: 


37) 


GGCCTGCTGT 


(SEQ 


ID 


NO: 


39) 


CCTGCTGTAA 


(SEQ 


ID 


NO: 


41) 


TGCTGTAAGG 


(SEQ 


ID 


NO: 


43) 


CTGTAAGGGA 


(SEQ 


ID 


NO: 


45) 


GTAAGGGAAA 


(SEQ 


ID 


NO: 


47) 


AAGGGAAAGA 


(SEQ 


ID 


NO: 


49) 


GGGAAAGAAT 


(SEQ 


ID 


NO: 


51) 


GAAAGAATGA 


(SEQ 


ID 


NO: 


53) 


AAGAATGAGA 


(SEQ 


ID 


NO: 


55) 


GAATGAGACG 


(SEQ 


ID 


NO: 


57) 


ATGAGACGAG 


(SEQ 


ID 


NO: 


59) 


GAGACGAGCT 


(SEQ 


ID 


NO: 


61) 


GACGAGCTGA 


(SEQ 


ID 


NO: 


63) 


CGAGCTGAGC 


(SEQ 


ID 


NO: 


65) 


AGCTGAGCCA 


(SEQ 


ID 


NO: 


67) 


CTGAGCCAGC 


(SEQ 


ID 


NO : 


69) 


GAGCCAGCAG 


(SEQ 


ID 


NO: 


71) 


GCCAGCAGCA 


(SEQ 


ID 


NO: 


73) 


CAGCAGCAGA 


(SEQ 


ID 


NO: 


75) 


GCAGCAGATG 


(SEQ 


ID 


NO: 


77) 


AGCAGATGGG 


(SEQ 


ID 


NO: 


79) 


CAGATGGGGT 


(SEQ 


ID 


NO: 


81) 
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AGATGGGGTG 


(SEQ 


ID 


NO : 


82 ) • 


GATGGGGTGG 


(SEO 


ID 


NO : 8 3 ) 


ATGGGGTGGG 


(SEO 

V ^ J - i w 


ID 


NO • 

xm w « 


84 ) * 




(SEO 


ID 


NO : 8 5 ) 


GGGGTGGGAG 


(SEO 


ID 

X XV 


NO • 


fifi ) • 
007 / 


GGGTGGGAG P 


(SEO 


TD 

X XV 


NO -87) 


GGTGGGAGPA 


( SPO 


X. XV 


NO • 


ft ft } ■ 


GTGGGAGPAG 


( SPO 


TD 

X LJ 


MO -89) 


TGGGAGPAGT 


( <^PO 


TD 
xxv 


TJO • 


q n ^ * 




(SEQ 


ID 


NO: 91) 


GGAGPAGTAT 




x. u 


mo • 


Q9l . 

-s I t 


G AGP APT ATP 


(SEQ 


ID 


NO: 93) 


AGCAGTATCT 


(SEO 


ID 


NO : 


■v*± y f 


GCAGTATCTC 


(SEQ 


ID 


NO: 95) 


CAGTATCTCG 


(SEQ 


ID 


NO : 


96) ; 


AGTATCTCGA 


(SEQ 


ID 


NO: 97) 


GTATCTCGAG 


(SEQ 


ID 


NO : 


98) ; 


TATCTCGAGA 


(SEQ 


ID 


NO: 99) 


ATCTCGAGAC 


(SEQ 


ID 


NO : 


100) ; 


TCTCGAGACC 


(SEQ 


ID NO:101) 


CTCGAGACCT 


(SEO 


ID 


NO : 


102) ; 


TPGAGACPTA 


(SEQ 


ID 


NO : 


103) ; 


PGAGAPPTAG 


/ CPA 

V o-tiy 


XJJ 


MO - 

INC - 




PAnAPPTAPA 


(SEQ 


ID 


NO : 


105) ; 


AbALL 1 AbM 


( obQ 


Tfv 

ID 


NO : 


10 6; ; 


CjAC C 1 AGj AAA 


( SPO 


TD 

X XV 


NO • 


X \J / / f 


AC 1 AO AAAA 


( SfcjQ 


ID 


NO : 


108 ) ; 


C CT AGAAAAA 


( QPO 


Tn 

X XV 


NO • 


X \J J ) t 






1JJ 




X1U) ; 




(SEO 


TD 

X. XV 


MO * 


111) - 
x x x ; , 


AGAAAA APAT 




TFi 
1U 




110V . 
xxz y , 


naAAAAPATH 
urittn/in^ri X O 


(SEO 


ID 

X XV 


NO • 

x^w ■ 


11^)* 
x. x. ._> ; f 


AAAAACATGG 




TD 

X. XV 


NO • 

XNV»V • 


114). 

x X*± / ^ 


AAAAPATGGA 


(SEQ 


ID 


NO : 


115) • 


AAACATGGAG 


(SEO 

\ 0 x_j\^ 


TD 

X> XV 


NO • 

XM vV • 


JL X U / ^ 


AAPATGGAGP 


(SEQ 


ID 


NO : 


117) ; 


APATGGAGPA 


( SPO 


in 

X XV 


MO • 


lift). 


PATGGAGPAA 


(SEQ 


ID 


NO : 


119) ; 


ATPftAPPA AT 


/ CTTO 


XJJ 


MO - 


iZU) , 


Trip a ftp a i\ t p 

1 wVjAijC>iA 1 


(SEQ 


ID 


NO : 


121) ; 


fiparra 7atpa 

oOAOCAAx CA 


V OxliVj 




JnU : 


xz Z ) ; 


PAPfA A TPA r" 1 

bAbUiAl LAL 


(SEQ 


ID 


NO : 


123 ) ; 


AbLAA 1 L-AL-A 


\ OjUiQ 


ID 


JNU : 


T *^ A \ - 

1^4; ; 


CjCAAx ualaa 


(SEQ 


ID 


NO : 


125) * 

X 4* J f 




(SEQ 


ID 




1^ o ; ; 


AA 1 UALAAb 1 


(SEQ 


Tn 

XXJ 


MO . 




■tt. X \~J\\~J\±\\j X .M. 




xu 


INC I 


lzo j ; 




( QPO 


ID 


NO: 


129) ; 






xu 


MPl • 


1 j u J ; 


APAAPTAftPA 


(SEQ 


ID 


NO: 


131) ; 




f QpD 


Tn 
X U 


MO • 




AAHTAftPAAT 


(SEQ 


ID 


NO: 


133) ; 


APTAPPA ATA 




X XV 


mo • 


1 \ . 
1 j*± / / 


HTAHPAATAP 

O X i~i\jr Livi X raw 


(SEQ 


ID 


NO: 


135) ; 


TAGPAATAPA 




Tn 

X XV 


mo ■ 

XNC • 


lJO / ; 


AftPAATAPAH 


(SEQ 


ID 


NO: 


137) ; 


GPAATAPAGP 






MO * 

XNC . 


1 ft ) • 


PA ATA P AGP A 


(SEQ 


ID 


NO: 


139) ; 


AATACAG PAG 


( SPO 


TD 

X XV 


MO * 


14 0)* 


ATAPAGPAGP 


(SEQ 


ID 


NO: 


141) ; 


TACAGCAGCT 


(SEQ 


TD 

X> XV 


NO • 


14 9) • 

X. *t f / 


APAGPAGPTA 


(SEQ 


ID 


NO: 


143) ; 


PAG PAG PT AA 


( QPO 


th 

X XV 


MO • 

XNVV » 


144) - 


AGPAGPTAAP 

nu v^ravJ V*- x xTtra\— 


(SEQ 


ID 


NO: 


145) ; 


oLAbL 1AALA 


( f~» XT' /^\ 

( bhiQ 


ID 


NO : 


14b) ; 


"A /""^ /^"Tt A A A A 

CJALiL i AALAA 


(SEQ 


ID 


NO: 


147) ; 


AGCTAACAAT 


(SEQ 


ID 


NO: 


148) ; 


GCTAACAATG 


(SEQ 


ID 


NO: 


149) ; 


CTAACAATGC 


(SEQ 


ID 


NO; 


150) ; 


TAACAATGCT 


(SEQ 


ID 


NO: 


151) ; 


AACAATGCTG 


(SEQ 


ID 


NO: 


152) ; 


ACAATGCTGC 


(SEQ 


ID 


NO: 


153) ; 


■ CAATGCTGCT 


(SEQ 


ID 


NO: 


154) ; 


AATGCTGCTT 


(SEQ 


ID 


NO: 


155) ; 


ATGCTGCTTG 


(SEQ 


ID 


NO: 


156) ; 


TGCTGCTTGT 


(SEQ 


ID 


NO: 


157) ; 
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GCTGCTTGTG 


(SEQ 


ID 


NO 


: 158) 


TGCTTGTGCC 


(SEQ 


ID 


NO 


: 160) 


CTTGTG CCTG 


(SEQ 


ID 


NO 


:162) 


TGTGCCTGGC 


(SEQ 


ID 


NO 


: 164) 


TGCCTGGCTA 


(SEQ 


ID 


NO 


:166) 


CCTGGCTAGA 


(SEQ 


ID 


NO 


:168) 


TGGCTAGAAG 


(SEQ 


ID 


NO 


:170) 


GCTAGAAGCA 


(SEQ 


ID 


NO 


:172) 


TAGAAGCACA 


(SEQ 


ID 


NO 


:174) 


GAAGCACAAG 


(SEQ 


ID 


NO 


:176) 


AGCACAAGAG 


(SEQ 


ID 


NO 


:178) 


CACAAGAGGA 


(SEQ 


ID 


NO 


:180) 


CAAGAGGAGG 


(SEQ 


ID 


NO 


:182) 


AGAGGAGGAA 


(SEQ 


ID 


NO 


:184) 


AGGAGGAAGA 


(SEQ 


ID 


NO 


:186) 


GAGGAAGAGG 


(SEQ 


ID 


NO 


188) 


GGAAGAGGTG 


(SEQ 


ID 


NO- 


190) 


AAGAGGTGGG 


(SEQ 


ID 


NO: 


192) 


GAGGTGGGTT 


(SEQ 


ID 


NO- 


194) 


GGTGGGTTTT 


(SEQ 


ID 


NO: 


196) 


TGGGTTTTCC 


(SEQ 


ID 


NO: 


198) 


GGTTTTCCAG 


(SEQ 


ID 


NO: 


200) 


TTTTCCAGTC 


(SEQ 


ID 


NO: 


202) 


TTCCAGTCAC 


(SEQ 


ID 


NO: 


204) 


CCAGTCACAC 


(SEQ 


ID 


NO: 


206) 


AGTGACACCT 


(SEQ 


ID 


NO: 


208) 


TCACACCTCA 


(SEQ 


ID 


NO. 


210) 


ACACCTCAGG 


(SEQ 


ID 


NO: 


212) 


ACCTCAGGTA 


(SEQ 


ID 


NO: 


214) 


CTCAGGTACC 


(SEQ 


ID 


NO: 


216) 


CAGGTACCTT 


(SEQ 


ID 


NO: 


218) 


GGTACCTTTA 


(SEQ 


ID 


NO: 


220) 


TACCTTTAAG 


(SEQ 


ID 


NO: 


222) 


CCTTTAAGAC 


(SEQ 


ID 


NO: 


224) 


TTTAAGACCA 


(SEQ 


ID 


NO: 


226) 


TAAGACCAAT 


(SEQ 


ID 


NO: 


228) 


AGACCAATGA 


(SEQ 


ID 


NO: 


230) 


AC CAATGACT 


(SEQ 


ID 


NO: 


232) 



242 - 



CTGCTTGTGC 


(SEQ 


ID 


NO 


.159) 


GCTTGTGCCT 


(SEQ 


ID 


NO 


161) 


TTGTGCCTGG 


(SEQ 


ID 


NO 


163) 


GTGCCTGGCT 


(SEQ 


ID 


NO. 


165) 


GCCTGGCTAG 


(SEQ 


ID 


NO: 


167) 


CTGGCTAGAA 


(SEQ 


ID 


NO. 


169) 


GGCTAGAAGC 


(SEQ 


ID 


NO 


171) 


CTAGAAGCAC 


(SEQ 


ID 


NO 


173) 


AGAAGCACAA 


(SEQ 


ID 


NO 


175) 


AAGCACAAGA 


(SEQ 


ID 


NO 


177) 


GCACAAGAGG 


(SEQ 


ID 


NO- 


179) 


ACAAGAGGAG 


(SEQ 


ID 


NO: 


181) 


AAGAGGAGGA 


(SEQ 


ID 


NO: 


183) 


GAGGAGGAAG 


(SEQ 


ID 


NO: 


185) 


GGAGGAAGAG 


(SEQ 


ID 


NO: 


187) 


AGGAAGAGGT 


(SEQ 


ID 


NO: 


189) 


GAAGAGGTGG 


(SEQ 


ID 


NO: 


191) 


AGAGGTGGGT 


(SEQ 


ID 


NO: 


193) 


AGGTGGGTTT 


(SEQ 


ID 


NO: 


195) 


GTGGGTTTTC 


(SEQ 


ID 


NO: 


197) 


GGGTTTT CCA 


(SEQ 


ID 


NO: 


199) 


GTTTTCCAGT 


(SEQ 


ID 


NO: 


201) 


TTTCCAGTCA 


(SEQ 


ID 


NO: 


203) 


TCCAGTCACA 


(SEQ 


ID 


NO: 


205) 


CAGTCACACC 


(SEQ 


ID 


NO: 


207) 


GTCACACCTC 


(SEQ 


ID 


NO: 


209) 


CACACCTCAG 


(SEQ 


ID 


NO: 


211) 


CACCTCAGGT 


(SEQ 


ID 


NO: 


213) 


CCTCAGGTAC 


(SEQ 


ID 


NO: 


215) 


TCAGGTACCT 


(SEQ 


ID 


NO: 


217) 


AGGTACCTTT 


(SEQ 


ID 


NO: 


219) 


GTACCTTTAA 


(SEQ 


ID 


NO: 


221) 


ACCTTTAAGA 


(SEQ 


ID 


NO: 


223) 


CTTTAAGACC 


(SEQ 


ID 


NO: 


225) 


TTAAGACCAA 


(SEQ 


ID 


NO 


227) 


AAGACCAATG 


(SEQ 


ID 


NO 


229) 


GACCAATGAC 


(SEQ 


ID 


NO 


:231) 


CCAATGACTT 


(SEQ 


ID 


NO 


:233) 
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P A A TP A fTT A 




XD 


IN LJ 


. Z j / 


AA X LxAL^ X X At 




TP, 

XD 


JNC 


: Z o d ; ; 






X u 


KTA 

Vi\J 




T'P a PTTA OA A 
X bAL X x AO AA 


/ CT?0 


T "Pi 

ID 


JNIU 


: z 6 I ) ; 


P A PTT A P A A P 
LxH.L^ 1 X AL^AtILj 


v ox^y 


TVS 
XXJ 






AO X X AOAALjO 


/ ct?o 


XD 


JNU 


: z j y ; ; 


pt t a p a a pp r* 

L- X X r\^r\f\\3\j L^ 




X xj 


IN LJ 


. Z ft U / / 


TTA PA APPP A 




Tn 
ID 




> z 4 X ; ; 


tapa apppap 




XXJ 


JN LJ 




A PA TS.HHHT^HH 
AL-AALjnjyOALjO 


^c,o 


XD 




z 4 6 ) ; 






Tn 

X LJ 


"MP 


» ^ *± *± / / 


A APPPAPPTP 
>i-M.LjLj ^ — H.Lj L- X Lj 


^ OXtiL^ 


Tn 

XXJ 




z ft d ; ; 


AGGCAGCTGT 


(SEO 


ID 


NO < 


246 ) ; 


GGCAGCTGTA 


(SEO 


ID 


NO : 


P4 7 ^ • 

" ■* * / /• 


GCAGCTGTAG 


(SEO 


ID 


NO : 


248) ; 


CAGCTGTAGA 


(SEO 


ID 


NO : 


24 9) ; 


AG CTGTAGAT 


(SEQ 


ID 


NO : 


250) ; 


GCTGTAGATC 


(SEQ 


ID 


NO : 


2 51) ; 


CTGTAGATCT 


(SEQ 


ID 


NO : 


252) ; 


TGT AG AT C T T 


(SEO 


ID 


NO: 


253) ; 


GTAGATCTTA 


(SEO 


ID 


NO : 


254) ; 


TAGATCTTAG 


(SEO 


ID 


NO: 


255) ; 


AGATPTTAPP 

rtVJri. lull nuv.. 




x u 


1NLJ . 


Z DO y , 


PATPTTAPPP 
LjAX O X X AvjOO 




Tn 

XD 


NO: 


257) ; 


AXL-X XAL?L^tJA 


(seq 


ID 


NO : 


o c o \ 

2 5 8; ; 


TCTTAGCCAC 


(SEQ 


ID 


NO: 


259) ; 


L. 1 1 AGLCACT 


(SEQ 


ID 


NO : 


2 60) ; 


TTAGCCACTT 


(SEQ 


ID 


NO: 


261) ; 


x ACjCCACTTT 


(SEQ 


ID 


NO : 


2 62) ; 


AGCCACTTTT 


(SEQ 


ID 


NO:263) ; 


uLLAL X X X X X 


/ nr»r\ 

voEQ 


XD 


NO : 


Z 6 4 ; ; 


CCACTTTTTA 


(SEQ 


ID 


NO: 


265) ; 


PA fTTTTTA a 
v — r\\- X XXX XAA 




T "Hi 
X D 


iNU : 




AO 1 x x x X AAA 


(SEQ 


XD 


NO: 


267) ; 


PTTTTTA AAA 
\— X X X X X jHJ-Lrxri 


v otpo. 


Tn 


inlj : 


zoo; ; 


X X X X XAAAAvjj 


/ Giro 


XD 


NO: 


269) ; 


TTTT A A A APA 




Tn 
xd 


MO * 


z / u ; ; 


rprprp A A A O A A 

1 X 1AAAAOAA 


(SbQ 


XD 


NO: 


271) ; 


TTA A A AP A A A 
X X AfiAAtj AAA 


/ oT?r\ 
\ oh*y 


XD 


viU : 


z /z ; ; 


TAAAAOAAAA 

TAAAAGAAAA 


(SEQ 


ID 


NO: 


273) ; 




/ PDA 

(SEQ 


XD 


JNU : 


2 74 ; ; 


AAAGAAAAGG 


(SEQ 


ID 


NO: 


275) ; 


a a o a a 7\ a oo/~* 
AALjAAAAGGG 


(SEQ 


ID 


NO : 


2 7 6) ; 


AGAAAAGGGG 


(SEQ 


ID 


NO: 


277) ; 


LxAAAAL3\jvjKj<j 


/ C?T70 

(SEQ 


ID 


NO : 


2 / o ) ; 


T\ 7\ 7\ 7\ /~1 /~i /~i /"I /*i 

AAAAGGGGGG 


(SEQ 


ID 


NO: 


279) ; 


AAAUCjVjCjviOA 


(SEQ 


ID 


NO : 


2 8 0; ; 


AAGGGGGGAC 


(SEQ 


ID 


NO: 


281) ; 


T\r^r % r^r^r^r^T\ ot^ 
ALjLiOLjvjLxAv- X 


IbhiQ 


ID 


NO : 


282 ; ; 


GGGGGGACTG 


(SEQ 


ID 


NO: 


283) ; 


LjLjLjLjLjAL. X VjVj 




Tn 
ID 


MO - 

NO : 


z 84 ; ; 


bbbbAt I OOA 


(SEQ 


ID 


NO: 


285) ; 


PPPA PTPP A A 




Tn 
xd 


MP • 

no : 


zoo; ; 


OO A PTPP A AO 
VjLj AO X uuAAu 


/ CPA 


XD 


NO: 


287) ; 


PA PTPP A Ann 




Tn 

XXJ 


MP . 

jnlj : 


O fl ft ^ . 

z d o ; ; 


AOTPOAAOOO 
AL- X OOAAOOO 




XD 


NO: 


289) ; 


PTPP A APPPP 


( qph 


Tn 
xd 


MP • 


z y u ; ; 


TPP A APPPPT 




XD 


NO: 


291) ; 


PP A APPPPT A 




Tn 
xd 


MP . 

no : 


zjz j ; 


PA AOOOOTA A 

OAAOOOO X AA 


/ OTTO 


XD 


NO: 


293) ; 


A APPPPT 1 A AT 1 
AALjLjLjL. 1 AAX 


IoEjQ 


XD 


NO : 


z y 4 ; ; 


AOOOOTA A Try 

AOOOOIAAl X 


(SEQ 


ID 


NO: 


295) ; 


PPr t / r,r P'A 7\ rprn*~% 

CjCjCjLIAAI it 


(SEQ 


TT"\ 

ID 


vr/*\ . 
NO : 


2 9 6; ; 


GGCTAATTCA 


(SEQ 


ID 


NO: 


297) ; 


GCTAATTCAC 


(SEQ 


ID 


NO: 


2 98) ; 


CTAATTCACT 


(SEQ 


ID 


NO: 


2 99) ; 


TAATTCACTC 


(SEQ 


ID 


NO: 


300) ; 


AATTCACTCC 


(SEQ 


ID 


NO: 


301) ; 


ATTCACTCCC 


(SEQ 


ID 


NO: 


302) ; 


TTCACTCCCA 


(SEQ 


ID 


NO: 


3 03) ; 


TCACTCCCAA 


(SEQ 


ID 


NO: 


3 04) ; 


CACTCCCAAA 


(SEQ 


ID 


NO: 


3 05) ; 


ACTCCCAAAG 


(SEQ 


ID 


NO: 


306) ; 


CTCCCAAAGA 


(SEQ 


ID 


NO: 


3 07) ; 


TCCCAAAGAA 


(SEQ 


ID 


NO: 


308) ; 


CCGAAAGAAG 


(SEQ 


ID 


NO: 


3 09) ; 
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/"I /~i Tv Tv t\ i—i t\ TV T\ 

C CAAAGAAGA 


(SEQ 


ID 


NO 


: 310) 


AAAUAAGACA 


(SEQ 


ID 


NO 


: 312 ) 


AGAAGACAAG 


(SEQ 


ID 


NO 


: 314 ) 


AAGACAAGAT 


(SEQ 


ID 


NO 


: 316) 


GACAAGATAT 


(SEQ 


ID 


NO 


: 318) 


C AAG AT AT C C 


(SEQ 


ID 


NO 


: 320) 


AG AT AT CCTT 


(SEQ 


ID 


NO 


: 322) 


ATATCCTTGA 


(SEQ 


ID 


NO 


: 324) 


ATCCTTGATC 


(SEQ 


ID 


NO 


: 326) 


CCTTGATCTG 


(SEQ 


ID 


NO 


:328) , 


TTGATCTGTG 


(SEQ 


ID 


NO 


:330) j 


GATCTGTGGA 


(SEQ 


ID 


NO 


:332) , 


T CTGTGG AT C 


(SEQ 


ID 


NO 


334) , 


TGTGGATCTA 


(SEQ 


ID 


NO: 


336) j 


TGGATCTACC 


(SEQ 


ID 


NO: 


338) ; 


GATCTACCAC 


(SEQ 


ID 


NO: 


340) ; 


TCTACCACAC 


(SEQ 


ID 


NO: 


342) ; 


TACCACACAC 


(SEQ 


ID 


NO: 


344) ; 


C CACACACAA 


(SEQ 


ID 


NO: 


346) ; 


ACACACAAGG 


(SEQ 


ID 


NO: 


348) ; 


ACACAAGGCT 


(SEQ 


ID 


NO: 


350) ; 


TV j^l TV TV >"*t ^» yi r*l TV A 

ACAAGGCTAC 


(SEQ 


ID 


NO: 


352) ; 


TV TV ^l^t j^m^ J 1 <t ■ *m 

AAGGCTACTT 


(SEQ 


ID 


NO: 


354) ; 


GGCTACTTCC 


(SEQ 


ID 


NO: 


3 56) ; 


CTACTTCCCT 


(SEQ 


ID 


NO: 


3 58) ; 


ACTTCCCTGA 


(SEQ 


ID 


NO: 


3 60) ; 


TTCCCTGATT 


(SEQ 


ID 


NO: 


362) ; 


CCCTGATTGG 


(SEQ 


ID 


NO: 


364) ; 


CTGATTGGCA 


(SEQ 


ID 


NO : 


366) ; 


G ATTGG CAG A 


(SEQ 


ID 


NO : 


3 68) ; 


1 1 v^GUAGAAC 


(SEQ 


ID 


NO : 


370) ; 


GGUAGAAC 1 A 


(SEQ 


ID 


NO : 


372 ) ; 


C AG AACJ T A C A 


(SEQ 


ID 


NO : 


374) ; 


bAAL I ACJACA 


(SEQ 


ID 


NO : 


376) ; 


ACTACACACC 


(SEQ 


ID 


NO: 


378) ; 


TACACACCAG 


(SEQ 


ID 


NO: 


380) / 


CACACCAGGG 


(SEQ 


ID 


NO: 


382) ; 


CACCAGGGCC 


(SEQ 


ID 


NO: 


384) ; 
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TV TV T\ /"^l TV TV S~1 TV 

CAAAGAAGAC 


(SEQ 


ID 


NO 


: 311) 


T\ T\ /~1 TV TV j^t TV /~1 TV TV 

AAGAAGACAA 


(SEQ 


ID 


NO 


: 313 ) 


/—I -Jv TV /—I TV /"( TV TV /**! TV 

GAAGACAAGA 


(SEQ 


ID 


NO 


: 315 ) 


AGACAAGATA 


(SEQ 


ID 


NO 


: 317) 


ACAAGATATC 


(SEQ 


ID 


NO 


: 319) 


tv ti tv m tv rn /^r <^~im 

AAG AT AT C C T 


(SEQ 


ID 


NO 


: 321) 


GATATCCTTG 


(SEQ 


ID 


NO 


: 323 ) 


m t\ rrr m m n tv m 

TATCCTTGAT 


(SEQ 


ID 


NO 


: 325) 


TCCTTGATCT 


(SEQ 


ID 


NO 


: 327) 


CTTGATCTGT 


(SEQ 


ID 


NO 


: 329) 


TGATCTGTGG 


(SEQ 


ID 


NO 


: 331) 


ATCTGTGGAT 


(SEQ 


ID 


NO 


:333) 


CTGTGGATCT 


(SEQ 


ID 


NO 


: 335) 


GTGGATCTAC 


(SEQ 


ID 


NO 


:337) 


GGATCTACCA 


(SEQ 


ID 


NO 


.339) 


ATCTACCACA 


(SEQ 


ID 


NO 


:341) 


CTACCACACA 


(SEQ 


ID 


NO 


343) 


ACCACACACA 


(SEQ 


ID 


NO 


345) 


CACACACAAG 


(SEQ 


ID 


NO. 


347) 


CACACAAGGC 


(SEQ 


ID 


NO: 


349) 


CACAAGGCTA 


(SEQ 


ID 


NO: 


351) 


CAAGGCTACT 


(SEQ 


ID 


NO: 


353) 


AGGCTACTTC 


(SEQ 


ID 


NO: 


355) 


GCTACTTCCC 


(SEQ 


ID 


NO: 


357) 


TACTTCCCTG 


(SEQ 


ID 


NO: 


359) 


CTTCCCTGAT 


(SEQ 


ID 


NO: 


361) 


TCCCTGATTG 


(SEQ 


ID 


NO: 


363) 


CCTGATTGGC 


(SEQ 


ID 


NO: 


365) 


TGATTGGCAG 


(SEQ 


ID 


NO : 


367) 


t\ mm/*iri/*tT. /— < tv tv 

ATTGG CAGAA 


(SEQ 


ID 


NO : 


369) 


rri/^ /"r /"r tv tv TV /uri 

TGGCAGAACT 


(SEQ 


ID 


NO : 


371) 


/~i tv tv /~i m TV /~t 

GCAGAAC TAC 


(SEQ 


ID 


NO : 


373) 


TV /-« TV TV /-I m 7V /~1T\ /^t 

AGAAC T ACAC 


(SEQ 


ID 


NO : 


375) 


TV TV /^m TV TV TV /^t 

AACTACACAC 


(SEQ 


ID 


NO: 


377) 


CTACACACCA 


(SEQ 


ID 


NO: 


379) 


ACACACCAGG 


(SEQ 


ID 


NO: 


381) 


ACACCAGGGC 


(SEQ 


ID 


NO: 


3 83) 


AC CAGGGC CA 


(SEQ 


ID 


NO: 


385) 
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/ ceo 


t pi 
1U 




: Job) ; 




(SEQ 


ID 


NO 


*1 O *~J \ 

: 3 8 7 ) ; 




/ cpn 


yn 
1JJ 




! j(3o j ; 




( SEQ 


ID 


NO 


: 3 8 9 ; ; 




f CT?0 
V OxL\^ 


X u 


NTH 


. -3 Q pi \ . 


PPPAPPPPTP 


(SEQ 


ID 


MA 

NO 


: 3 9 1 ) ; 




( qpn 




vi\J 


. TOO \ - 


\^J\\j\j\j\j 1 LAb 


/ OTA 

(SEQ 


ID 


NO 


*3 n "5 \ 

: 3 9 3 / ; 






X u 




, OQA \ . 

! jjft 


bbbb X L-A.CjA I 


(bbQ 


ID 


NO 


c> r~ \ 

: 3 9 5 ) ; 


GGGTCAGATA 


V O X_i^ 




"MO 






/ ceo 


ID 


TATA 


: jy / ] ; 


GTCAGATATC 


(SEO 


ID 


NO 




TrAGATATrr 




x u 




. 3 J J J f 


CAGATATCCA 


(SEQ 


ID 


NO 


: 4 0 0 ) ; 


AGATATCCAC 


(SEO 


ID 


NO 


^ U X j ^ 


GATATCCACT 


(SEO 


ID 


NO 


: 4 0 2 ) ; 


ATATCCACTG 


(SEO 


TD 

JL i-J 


NO 


*± \J o J t 


TATCCACTGA 


(SEO 


ID 


NO 




ATfTAfTCflr 

ill ^ v-wrl> — X \Jrl\^ 




Tn 

X u 


XN\»/ 


ft U D / , 


TCCACTCl&CC 


f qtto 


X JJ 


Mp 


, /t pi \ . 




/ OCA 

(SEQ 


ID 


MA 
NO , 


4 07) ; 


P 1 Z\ PTP A PPTT 
OIL- 1 (jAtL 1 1 


\ ohy 


ID 


NO 


: 4 0 8 ; ; 


ACTGACCTTT 


(SEQ 


ID 


NO : 


4 09) ; 


U 1 bALL J. 1 lb 


(SEQ 


ID 


NO 


: 4 1 0 ) ; 


TGACCTTTGG 


(SEQ 


ID 


NO : 


411) ; 


bACCT I TGGA 


(SEQ 


ID 


NO 


: 412 ) ; 


ACCTTTGGAT 


(SEQ 


ID 


NO: 


413) ; 


UL1 x ICjOAICj 


/ PDA 

(SEQ 


ID 


NO : 


414 ) ; 


CTTTGGATGG 


(SEQ 


ID 


NO : 


415) ; 


T" i " van z\ t/7pt 




ID 


WU : 


4 lb J ; 


TTGGATGGTG 


(SEQ 


ID 


NO : 


417) ; 


TGG A TGHTG P 

X VJVJ X VJVJ X Vj V_ 


f Qpo 


X XJ 




/ 1 Q\ . 

4lo j ; 


r*C* A f PPP r PP /^~irp 

CjLjA 1 Ubr I (jjC X 


(SEQ 


ID 


XT A ^ 

NO : 


419) ; 


\Jr.f1 X \J\J X VJ\w X .rt. 




TTi 
X JJ 


MO - 


a o n ^ - 
U } ; 


A 1 bVj 1 bL 1 AC 


(SEQ 


ID 


MA - 

NO : 


4 21) ; 


X uu X uL X>^\_>1 






InU : 


*±<£Z ) ; 


GGTGCTACAA 


(SEQ 


ID 


XT A 

NO : 


42 3) ; 


ftTHPTAPA ZVP 
Vjr X X 


( QT?p\ 


ID 


J>JU : 


4 24; ; 


TG C T ACAAG C 


(SEQ 


ID 


NO : 


425) ; 


bLlALAAbL X 


(JbttAJ 


1U 


NO : 


4 2 6) ; 


CTACAAGCTA 


(SEQ 


ID 


NO : 


42 7) ; 


TAPRR opt A P 1 




ID 


NO : 


4 28) ; 


ACAAGCTAGT 


(SEQ 


ID 


NO : 


42 9) ; 


PLAP,PTZir!Ta 
UHHIjL 1 Ab X A 




ID 


XMU : 


4 3 0 ) ; 


AAGCTAGTAC 


(SEQ 


ID 


NO : 


431) ; 


arzr*Tar ,r ra p'P 1 




ID 


NU : 


4 32) ; 


GCTAGTACCA 


(SEQ 


ID 


NO : 


43 3) ; 


C X Abj 1 ACb-Atj 


(Shy 


ID 


NO : 


A *^ A \ 

434) ; 


T AGT AC CAGT 


(SEQ 


ID 


NO : 


435) ; 


Abj lALUAbl 1 


( biiQ 


ID 


NO : 


4 3 6) ; 


GTACCAGTTG 


(SEQ 


ID 


NO : 


43 7) ; 


xAuUAbix IbA 


(SEQ 


ID 


NO : 


43 8); 


ACCAGTTGAG 


(SEQ 


ID 


NO: 


439) ; 


AAA ATT'A 7\ a A 

C LAG TTGAG C 


(SEQ 


ID 


NO : 


440) ; 


CAGTTGAGCC 


(SEQ 


ID 


NO: 


441) ; 


7V A mmri t\ AAA T\ 

AGTTGAGCCA 


(SEQ 


ID 


NO : 


442) ; 


GTTGAGCCAG 


(SEQ 


ID 


NO: 


443) ; 


TTGAGCCAGA 


(SEQ 


ID 


NO : 


444) ; 


TGAGCCAGAT 


(SEQ 


ID 


NO: 


44 5) ; 


GAGC CAG ATA 


(SEQ 


ID 


NO : 


446) ; 


AGCCAGATAA 


(SEQ 


ID 


NO: 


447) ; 


<a LAOA I AAG 


(SEQ 


ID 


NO : 


44 8) ; 


CCAGATAAGG 


(SEQ 


ID 


NO: 


44 9) ; 


P* 7\ P 1 7A T7\ 7\ ppm 
LAbrA 1 AALjbr X 


/ o XT' A 

( SEQ 


ID 


NO : 


4 50) ; 


AGATAAGGTA 


(SEQ 


ID 


NO: 


451) ; 


GATAAGGTAG 


(SEQ 


ID 


NO: 


452) ; 


. ATAAGGTAGA 


(SEQ 


ID 


NO: 


453) ; 


TAAGGTAGAA 


(SEQ 


ID 


NO: 


454) ; 


AAGGTAGAAG 


(SEQ 


ID 


NO: 


455) ; 


AGGTAGAAGA 


(SEQ 


ID 


NO: 


456) ; 


GGTAGAAGAG 


(SEQ 


ID 


NO: 


457) ; 


GTAGAAGAGG 


(SEQ 


ID 


NO: 


458) ; 


TAGAAGAGGC 


(SEQ 


ID 


NO: 


459) ; 


AGAAGAGGCC 


(SEQ 


ID 


NO: 


460) ; 


GAAGAGGCCA 


(SEQ 


ID 


NO: 


461) ; 
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AAGAGGCCAA 


(SEQ 


ID 


NO 


•462) ; 


AGAGGCCAAT 


(SEQ 


ID 


NO: 


463 ) ; 


GAGG CCAATA 


(SEQ 


ID 


NO 


4 64) ; 


AGGCCAATAA 


(SEQ 


ID 


NO; 


465) ; 


GGCCAATAAA 


(SEQ 


ID 


NO 


466) ; 


GCCAATAAAG 


(SEQ 


ID 


NO: 


467) ; 


CCAATAAAGG 


(SEQ 


ID 


NO: 


468) ; 


CAATAAAGGA 


(SEQ 


ID 


NO: 


469) ; 


AATAAAGGAG 


(SEQ 


ID 


NO: 


470) ; 


ATAAAGGAGA 


(SEQ 


ID 


NO: 


471) ; 


TAAAGGAGAG 


(SEQ 


ID 


NO: 


4 72) ; 


AAAGGAGAGA 


(SEQ 


ID 


NO: 


473) ; 


AAGGAGAGAA 


(SEQ 


ID 


NO: 


4 74) ; 


AGGAGAGAAC 


(SEQ 


ID 


NO: 


475) ; 


GGAGAGAACA 


(SEQ 


ID 


NO: 


476) ; 


GAGAGAACAC 


(SEQ 


ID 


NO: 


477) ; 


AGAGAACACC 


(SEQ 


ID 


NO: 


478) ; 


GAGAACACGA 


(SEQ 


ID 


NO: 


479) ; 


AG AACAC CAG 


(SEQ 


ID 


NO: 


480) ; 


GAACACCAGC 


(SEQ 


ID 


NO: 


481) ; 


AACACCAGCT 


(SEQ 


ID 


NO: 


4 82) ; 


ACACCAGCTT 


(SEQ 


ID 


NO: 


4 8 3) ; 


CACCAGCTTG 


(SEQ 


ID 


NO: 


4 84) ; 


ACCAGCTTGT 


(SEQ 


ID 


NO: 


485) ; 


CCAGCTTGTT 


(SEQ 


ID 


NO: 


486) ; 


CAGCTTGTTA 


(SEQ 


ID 


NO: 


487) ; 


AGCTTGTTAC 


(SEQ 


ID 


NO: 


488) ; 


GCTTGTTACA 


(SEQ 


ID 


NO: 


489) ; 


CTTGTTACAC 


(SEQ 


ID 


NO: 


490) ; 


TTGTTACACC 


(SEQ 


ID 


NO: 


491) ; 


TGTTACACCC 


(SEQ 


ID 


NO: 


492) ; 


GTTACACCCT 


(SEQ 


ID 


NO: 


493) ; 


TTACACCCTG 


(SEQ 


ID 


NO: 


4 94) ; 


TACACCCTGT 


(SEQ 


ID 


NO: 


495) ; 


ACACCCTGTG 


(SEQ 


ID 


NO: 


496) ; 


CACCCTGTGA 


(SEQ 


ID 


NO: 


497) ; 


ACCCTGTGAG 


(SEQ 


ID 


NO: 


498) ; 


CCCTGTGAGC 


(SEQ 


ID 


NO: 


499) ; 


CCTGTGAGCC 


(SEQ 


ID 


NO: 


500) ; 


CTGTGAGCCT 


(SEQ 


ID 


NO: 


501) ; 


TGTGAGCCTG 


(SEQ 


ID 


NO: 


502) ; 


GTGAGCCTGC 


(SEQ 


ID 


NO: 


503) ; 


TGAGCCTGCA 


(SEQ 


ID 


NO: 


504) ; 


GAGCCTGCAT 


(SEQ 


ID 


NO: 


505) ; 


AGCCTGCATG 


(SEQ 


ID 


NO: 


506) ; 


GCCTGCATGG 


(SEQ 


ID 


NO: 


507) ; 


CCTGCATGGA 


(SEQ 


ID 


NO: 


508) ; 


CTGCATGGAA 


(SEQ 


ID 


NO: 


509) ; 


TGCATGGAAT 


(SEQ 


ID 


NO: 


510) ; 


GCATGGAATG 


(SEQ 


ID 


NO: 


511) ; 


CATGGAATGG 


(SEQ 


ID 


NO: 


512) ; 


ATGGAATGGA 


(SEQ 


ID 


NO: 


513) ; 


TGGAATGGAT 


(SEQ 


ID 


NO: 


514) ; 


GGAATGGATG 


(SEQ 


ID 


NO: 


515) ; 


GAATGGATGA 


(SEQ 


ID 


NO: 


516) ; 


AATGGATGAC 


(SEQ 


ID 


NO: 


517) ; 


ATGGATGACC 


(SEQ 


ID 


NO: 


518) ; 


TGGATGACCC 


(SEQ 


ID 


NO: 


519) ; 


GGATGACCCT 


(SEQ 


ID 


NO: 


520) ; 


GATGACCCTG 


(SEQ 


ID 


NO: 


521) ; 


ATGACCCTGA 


(SEQ 


ID 


NO: 


522) ; 


TGACCCTGAG 


(SEQ 


ID 


NO: 


523) ; 


GACCCTGAGA 


(SEQ 


ID 


NO: 


524) ; 


ACCCTGAGAG 


(SEQ 


ID 


NO: 


525) ; 


C C CTGAGAGA 


(SEQ 


ID 


NO: 


526) ; 


C CTGAGAG AG 


(SEQ 


ID 


NO: 


527) ; 


CTGAGAGAGA 


(SEQ 


ID 


NO : 


528) ; 


TGAGAGAGAA 


(SEQ 


ID 


NO: 


529) ; 


GAGAGAGAAG 


(SEQ 


ID 


NO: 


530) ; 


AGAGAGAAGT 


(SEQ 


ID 


NO: 


531) ; 


GAGAGAAGTG 


(SEQ 


ID 


NO: 


532) ; 


AGAGAAGTGT 


(SEQ 


ID 


NO: 


533) ; 


GAGAAGTGTT 


(SEQ 


ID 


NO: 


534) ; 


AGAAGTGTTA 


(SEQ 


ID 


NO: 


535) ; 


GAAGTGTTAG 


(SEQ 


ID 


NO: 


536) ; 


AAGTGTTAGA 


(SEQ 


ID 


NO: 


537) ; 
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AGTGTTAGAG 


(SEQ 


ID 


NO 


:538) 


TGTTAGAGTG 


(SEQ 


ID 


NO 


:540) 


TTAGAGTGGA 


(SEQ 


ID 


NO 


:542) 


AGAGTGGAGG 


(SEQ 


ID 


NO 


:544) 


AGTGGAGGTT 


(SEQ 


ID 


NO 


:546) 


TGGAGGTTTG 


(SEQ 


ID 


NO 


•548) 


GAGGTTTGAC 


(SEQ 


ID 


NO- 


550) 


GGTTTGACAG 


(SEQ 


ID 


NO: 


552) 


TTTGACAGCC 


(SEQ 


ID 


NO: 


554) 


TGACAGCCGC 


(SEQ 


ID 


NO: 


556) 


ACAGCCGCCT 


(SEQ 


ID 


NO: 


558) 


AGCCGCCTAG 


(SEQ 


ID 


NO: 


560) 


CCGCCTAGCA 


(SEQ 


ID 


NO: 


562) 


GCCTAGCATT 


(SEQ 


ID 


NO: 


564) 


CTAGCATTTC 


(SEQ 


ID 


NO: 


566) 


AGCATTTCAT 


(SEQ 


ID 


NO: 


568) 


CATTTCATCA 


(SEQ 


ID 


NO: 


570) 


TTTCATCACG 


(SEQ 


ID 


NO: 


572) 


TCATCACGTG 


(SEQ 


ID 


NO: 


574) 


ATCACGTGGC 


(SEQ 


ID 


NO: 


576) 


CACGTGGCCC 


(SEQ 


ID 


NO : 


578) 


GGTGGCCCGA 


(SEQ 


ID 


NO: 


580) 


TGGCCCGAGA 


(SEQ 


ID 


NO: 


582) 


GCCCGAGAGC 


(SEQ 


ID 


NO: 


584) 


CCGAGAGCTG 


(SEQ 


ID 


NO: 


586) 


GAGAGCTGCA 


(SEQ 


ID 


NO: 


588) 


GAGCTGCATC 


(SEQ 


ID 


NO: 


590) 


GCTGCATCCG 


(SEQ 


ID 


NO: 


592) 


TGCATCCGGA 


(SEQ 


ID 


NO : 


594) 


CATCCGGAGT 


(SEQ 


ID 


NO: 


596) 


TCCGGAGTAC 


(SEQ 


ID 


NO: 


598) 


CGGAGTACTT 


(SEQ 


ID 


NO: 


600) 


GAGTACTTCA 


(SEQ 


ID 


NO: 


602) 


GTACTTCAAG 


(SEQ 


ID 


NO: 


604) 


ACTTCAAGAA 


(SEQ 


ID 


NO: 


606) 


TTCAAGAACT 


(SEQ 


ID 


NO: 


608) 


CAAGAACTGC 


(SEQ 


ID 


NO: 


610) 


AGAACTGCTG 


(SEQ 


ID 


NO: 


612) 



GTGTTAGAGT 


(SEQ 


ID 


NO 


:539) ; 


GTTAGAGTGG 


(SEQ 


ID 


NO 


:541) ; 


TAGAGTGGAG 


(SEQ 


ID 


NO 


:543) ; 


GAGTGGAGGT 


(SEQ 


ID 


NO 


:545) ; 


GTGGAGGTTT 


(SEQ 


ID 


NO 


:547) ; 


GGAGGTTTGA 


(SEQ 


ID 


NO 


:549) ; 


AGGTTTGACA 


(SEQ 


ID 


NO 


:551) ; 


GTTTGACAGC 


(SEQ 


ID 


NO 


:553) ; 


TTGACAGCCG 


(SEQ 


ID 


NO 


:555) ; 


GACAGCCGCC 


(SEQ 


ID 


NO 


:557) ; 


CAGCCGCCTA 


(SEQ 


ID 


NO 


559) ; 


GCCGCCTAGC 


(SEQ 


ID 


NO 


561) ; 


CGCCTAGCAT 


(SEQ 


ID 


NO 


563) ; 


CCTAGCATTT 


(SEQ 


ID 


NO: 


565) ; 


TAGCATTTCA 


(SEQ 


ID 


NO: 


567) ; 


GCATTTCATC 


(SEQ 


ID 


NO: 


569) ; 


ATTTCATCAC 


(SEQ 


ID 


NO: 


571) ; 


TTCATCACGT 


(SEQ 


ID 


NO: 


573) ; 


CATCACGTGG 


(SEQ 


ID 


NO: 


575) ; 


TCACGTGGCC 


(SEQ 


ID 


NO: 


577) ; 


ACGTGGCCCG 


(SEQ 


ID 


NO: 


579) ; 


GTGGCCCGAG 


(SEQ 


ID 


NO: 


581) ; 


GGCCCGAGAG 


(SEQ 


ID 


NO: 


583) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO: 


585) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO: 


587) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO: 


589) ; 


AGCTGCATCC 


(SEQ 


ID 


NO: 


591) ; 


CTGCATCCGG 


(SEQ 


ID 


NO: 


593) ; 


GCATCCGGAG 


(SEQ 


ID 


NO: 


595) ; 


ATCCGGAGTA 


(SEQ 


ID 


NO: 


597) ; 


CCGGAGTACT 


(SEQ 


ID 


NO: 


599) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 


601) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 


603) ; 


TACTTCAAGA 


(SEQ 


ID 


NO. 


605) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO- 


607) ; 


TCAAGAACTG 


(SEQ 


ID 


NO 


609) ; 


AAGAACTGCT 


(SEQ 


ID 


NO. 


611) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 


613) . 
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92. A method according to claim 86 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HIV-1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID 


NO: 


652) ; 


CTTTTTGCCT 


(SEQ 


ID 


NO: 


653 ) ; 


TTTTTGCCTG 


(SEQ 


ID 


NO: 


654) ; 


TTTTGCCTGT 


(SEQ 


ID 


NO: 


655) ; 


TTTGCCTGTA 


(SEQ 


ID 


NO: 


656) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO: 


657) ; 


TGCCTGTACT 


(SEQ 


ID 


NO: 


658) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 


659) ; 


CCTGTACTGG 


(SEQ 


ID 


NO: 


660) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 


661) ; 


TGTACTGGGT 


(SEQ 


ID 


NO: 


662) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 


663) ; 


TACTGGGTCT 


(SEQ 


ID 


NO: 


664) ; 


ACTGGGT CTC 


(SEQ 


ID 


NO: 


665) ; 


CTGGGTCTCT 


(SEQ 


ID 


NO: 


666) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO: 


667) ; 


GGGTCTCTCT 


(SEQ 


ID 


NO: 


668) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO: 


669) ; 


GTCTCTCTGG 


(SEQ 


ID 


NO: 


670) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 


671) ; 


CTCTCTGGTT 


(SEQ 


ID 


NO: 


672) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


673) ; 


CTCTGGTTAG 


(SEQ 


ID 


NO: 


674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


675) ; 


CTGGTTAGAC 


(SEQ 


ID 


NO: 


676) ; 


TGGTTAGACC 


(SEQ 


ID 


NO: 


677) ; 


GGTTAGACCA 


(SEQ 


ID 


NO: 


678) ; 


GTTAG AC CAG 


(SEQ 


ID 


NO: 


679) ; 


TTAGACCAGA 


(SEQ 


ID 


NO: 


680) ; 


TAGACCAGAT 


(SEQ 


ID 


NO: 


681) ; 


AGACCAGATC 


(SEQ 


ID 


NO: 


682) ; 


GACCAGATCT 


(SEQ 


ID 


NO: 


683) ; 


ACCAGATCTG 


(SEQ 


ID 


NO: 


684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO: 


685) ; 


CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO: 


687) ; 


GATCTGAGCC 


(SEQ 


ID 


NO: 


688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO: 


689) ; 


TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO: 


691) ; 


TGAGCCTGGG 


(SEQ 


ID 


NO: 


692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 


6 93) ; 


AGCCTGGGAG 


(SEQ 


ID 


NO: 


694) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO: 


695) ; 


CCTGGGAGCT 


(SEQ 


ID 


NO: 


696) ; 


CTGGGAGCTC 


(SEQ 


ID 


NO: 


697) ; 


TGGGAGCTCT 


(SEQ 


ID 


NO: 


698) ; 


GGGAGCTCTC 


(SEQ 


ID 


NO: 


699) ; 


GGAGCTCTCT 


(SEQ 


ID 


NO: 


700) ; 


GAGCTCTCTG 


(SEQ 


ID 


NO: 


701) ; 


AGCTCTCTGG 


(SEQ 


ID 


NO: 


702) ; 


GCTCTCTGGC 


(SEQ 


ID 


NO: 


703) ; 


CTCTCTGGCT 


(SEQ 


ID 


NO: 


704) ; 


TCTCTGGCTA 


(SEQ 


ID 


NO: 


705) ; 


CTCTGGCTAA 


(SEQ 


ID 


NO: 


706) ; 


TCTGGCTAAC 


(SEQ 


ID 


NO: 


707) ; 


CTGGCTAACT 


(SEQ 


ID 


NO: 


708) ; 


TGGCTAACTA 


(SEQ 


ID 


NO: 


709) ; 


GGCTAACTAG 


(SEQ 


ID 


NO: 


710) ; 


GCTAACTAGG 


(SEQ 


ID 


NO: 


711) ; 


CTAACTAGGG 


(SEQ 


ID 


NO: 


712) ; 


TAACTAGGGA 


(SEQ 


ID 


NO: 


713) ; 


AACTAGGGAA 


(SEQ 


ID 


NO: 


714) ; 


ACTAGGGAAC 


(SEQ 


ID 


NO: 


715) ; 


GTAGGGAACC 


(SEQ 


ID 


NO: 


716) ; 


TAGGGAACCC 


(SEQ 


ID 


NO: 


717) ; 


AGGGAACCCA 


(SEQ 


ID 


NO: 


718) ; 


GGGAACCCAC 


(SEQ 


ID 


NO: 


719) ; 
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GGAACCCACT 


(SEQ 


ID 


NO: 


720) ; 


GAACCCACTG 


(SEQ 


ID 


NO: 


721) ; 


AACCCACTGC 


(SEQ 


ID 


NO: 


722) ; 


ACCCACTGCT 


(SEQ 


ID 


NO: 


723) ; 


CCCACTGCTT 


(SEQ 


ID 


NO: 


724) ; 


CCACTGCTTA 


(SEQ 


ID 


NO: 


725) ; 


CACTGCTTAA 


(SEQ 


ID 


NO: 


726) ; 


ACTGCTTAAG 


(SEQ 


ID 


NO: 


72 7) ; 


CTGCTTAAGC 


(SEQ 


ID 


NO: 


728) ; 


TGCTTAAGCC 


(SEQ 


ID 


NO: 


729) ; 


GCTTAAGCCT 


(SEQ 


ID 


NO: 


730) ; 


CTTAAGCCTC 


(SEQ 


ID 


NO: 


731) ; 


TTAAGCCTCA 


(SEQ 


ID 


NO: 


732) ; 


TAAGCCTCAA 


(SEQ 


ID 


NO: 


733) ; 


AAGCCTCAAT 


(SEQ 


ID 


NO: 


734) ; 


AGCCTCAATA 


(SEQ 


ID 


NO: 


735) ; 


GCCTCAATAA 


(SEQ 


ID 


NO: 


736) ; 


CCTCAATAAA 


(SEQ 


ID 


NO: 


737) ; 


CTCAATAAAG 


(SEQ 


ID 


NO: 


73 8) ; 


TCAATAAAGC 


(SEQ 


ID 


NO: 


73 9) ; 


CAATAAAGCT 


(SEQ 


ID 


NO: 


740) ; 


AATAAAGCTT 


(SEQ 


ID 


NO: 


741) ; 


ATAAAGCTTG 


(SEQ 


ID 


NO: 


742) ; 


TAAAGCTTGC 


(SEQ 


ID 


NO: 


743) ; 


AAAGCTTGCC 


(SEQ 


ID 


NO: 


744) ; 


AAGCTTGCCT 


(SEQ 


ID 


NO: 


745) ; 


AGCTTGCCTT 


(SEQ 


ID 


NO: 


746) ; 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


747) ; 


GTTGCCTTGA 


(SEQ 


ID 


NO: 


748) ; 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) ; 


TGCCTTGAGT 


(SEQ 


ID 


NO: 


750) ; 


GCCTTGAGTG 


(SEQ 


ID 


NO: 


751) ; 


CCTTGAGTGC 


(SEQ 


ID 


NO: 


752) ; 


CTTGAGTGCT 


(SEQ 


ID 


NO: 


753) ; 


TTGAGTGCTT 


(SEQ 


ID 


NO: 


754) ; 


TGAGTGCTTC 


(SEQ 


ID 


NO: 


755) ; 


GAGTGCTTCA 


(SEQ 


ID 


NO: 


756) ; 


AGTGCTTCAA 


(SEQ 


ID 


NO: 


757) ; 


GTGCTTCAAG 


(SEQ 


ID 


NO: 


758) ; 


TGCTTCAAGT 


(SEQ 


ID 


NO: 


759) ; 


GCTTCAAGTA 


(SEQ 


ID 


NO: 


760) ; 


CTTCAAGTAG 


(SEQ 


ID 


NO: 


761) ; 


TTCAAGTAGT 


(SEQ 


ID 


NO: 


762) ; 


TCAAGTAGTG 


(SEQ 


ID 


NO: 


763) ; 


CAAGTAGTGT 


(SEQ 


ID 


NO: 


764) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


765) ; 


AGTAGTGTGT 


(SEQ 


ID 


NO: 


766) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO: 


767) ; 


TAGTGTGTGC 


(SEQ 


ID 


NO: 


768) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO: 


76 9) ; 


GTGTGTGCCC 


(SEQ 


ID 


NO: 


770) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 


771) ; 


GTGTGCCCGT 


(SEQ 


ID 


NO: 


772) ; 


TGTGCCCGTC 


(SEQ 


ID 


NO: 


773) ; 


GTGCCCGTCT 


(SEQ 


ID 


NO: 


774) ; 


TGCCCGTCTG 


(SEQ 


ID 


NO: 


775) ; 


GCCCGTCTGT 


(SEQ 


ID 


NO: 


776) ; 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) ; 


CCGTCTGTTG 


(SEQ 


ID 


NO: 


778) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) ; 


GTCTGTTGTG 


(SEQ 


ID 


NO: 


780) ; 


TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) ; 


CTGTTGTGTG 


(SEQ 


ID 


NO: 


782) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 


783) ; 


GTTGTGTGAC 


(SEQ 


ID 


NO: 


784) ; 


TTGTGTGACT 


(SEQ 


ID 


NO: 


785) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 


786) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


787) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 


788) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) ; 


TGTGACTCTG 


(SEQ 


ID 


NO: 


790) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 
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TGACTCTGGT (SEQ ID NO; 792} 

ACTCTGGTAA (SEQ ID NO: 794) 

TCTGGTAACT (SEQ ID NO: 796) 

TGGTAACTAG (SEQ ID NO: 798) 



GACTCTGGTA (SEQ ID NO: 793) 

CTCTGGTAAC (SEQ ID NO: 795) 

CTGGTAACTA (SEQ ID NO: 797) 

GGTAACTAGA (SEQ ID NO: 799) 
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TGGAAGGGCTAATTCACTCACGGAAAAGACCAGTTGAACCAG 

AAGAAGATAGAAGAGGCCATGAAGAAGAAAACAACAGATTGT 

TCTGCTTGCTCAGCTGGGGACTTTCCAGAAGGCGCGGCCTGA 

GTGACTAAGCCCCGTTGGGGACTTTCCGAAGAGGCATGAAGG 

GACTTTCCAAGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTG 

GCGAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTGCCT 

GTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGC 

TCTCTGGCTAGCTAGGGAACCCACTGCTTAAGCCTCAATAAA 

GCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGT 

GTGACTCTGGTATCTAGAGATCCCTCAGACCATTTTAGTCCG 

TGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGGGACTTGAA 

AGCGAAAGGAAAACCAGAGGAGCTCTCTCGACGCAGGACTCG 

GCTTGCTGAAGCGCGCACGGCAAGAGGCGAGGGGCGGCGACT 

GGTGAGTACGCCGAAAATTTTGACTAGCGGAGGCTAGAAGGA 

GAGAGATGGGTGCGAGAGCGTCAATATTAAGCGGGGGAAAAT 

TAGATAGATGGGAGAAAATTCGGTTAAGGCCAGGAGGAAAGA 

AAAAGTATAAATTAAAACATATAGTATGGGCAAGCAGGGAGC 

TAGAACGATTCGCAGTCAATCCTGGCCTGTTGGAAACATCAG 

AAGGCTGTAGACAAATACTGGGACAGTTACACCCGTCCCTTC 

AGACAGGATCAGAAGAACTTAAATCAGTATATAATGCAGTAG 

CAGTCCTCTATTGTGTGCATCAAAACATAGACATAAAGGACA 

CCAAGGAAGCTTTAGAAAAGATAGAGGAAGAGCAAAACAAAT 

GTAAGAAAAAAGCACAGCAAGCAGCAGCACAGCAAGCAGCAG 

CTGGCACAGGAAACAGCAACCCGGTCAGCCAAAATTACCCTA 

TAGTACAGAACATGCAGGGGCAAATGGTACATCAGGCCATAT 

CACCTAGAACTTTAAATGCATGGGTAAAAGTAATAGAAGAGA 

AGGCTTTCAGCCCAGAGGTAATACCCATGTTTTCAGCATTAT 

C AG AAGGAG C C AC C C C AC AAGATTT AAAC AC C ATGC T AAAC A 

CAGTGGGGGGACATCAAGCAGCTATGCAAATGTTAAAAGAGA 
C CAT C AATG AGGAAG CTG C AGAATGGGATAGATT AC AT C C AG 
CG C AGGCAGGGC CTGTTG CAC CAGGC CAGATGAGAGACC CAA 
GGGGAAGTGACATAGCAGGAACTACTAGTACCCTTCAGGAAC 
AAATAGGATGGATG AC AGGT AAT C C AG CTAT C C CAGT AGGAG 
AAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAG 
T AAGGATGT ATAG C C CTAT C AGC ATT C TGGAC ATAAAAC AAG 
G AC CAAAGGAAC C CTTTAGAGACTATGTAGACCGGTT CTATA 
AAACTCTAAGAG C CGAGC AAGCTAC ACAGGAGGTAAAAAATT 
GGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATT 
GTAAGACTATTTTAAAAGCATTGGGACCAGCAGCTACACTAG 
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AAGAAATGATGACAGCATGTCAGGGAGTGGGAGGAC C CAGC C 

AT AAAG C AAG AGTT T T GGC AG AAG CAAT GAG C C AAG C AAC AA 

ATGCAGCTACTGTAATGATGCAGAGAAGCAATTTTAGAAACC 

AAAGAAAGAATGTTAAGTGTTTCAATTGTGGCAAAGAAGGGC 

ACATAGCCAGAAATTGCAGGGCTCCTAGGAAAAGGGGCTGTT 

GGAAATGTGGAAAGGAAGGACACCAAATGAAAGATTGTACTG 

AGAGACAGGCTAATTTTTTAGGGAAAATCTGGCCTTCCCACA 

AGGGGAGGCCAGGGAACTTTCTTCAGAGCAGGCCAGAACCAA 

CAGCCCCTCTCCAGGGCAGGCCGGAGCCATCAGCCCCGCCAG 

AAGAGAGCTTCAGGTTTGGGGAGGAGACAACAACTCCCTCTC 

AGAAGCAGGAGCCGATAGACAGGGACAGGGATCTGTATCCTT 

TAGCTTCCCTCAGATCACTCTTTGGCAACGACCCCTCGTCAC 

AATAAAGATAGGGGGGCAGCTGAAGGAAGCTCTATTAGATAC 

AGGAG C AGATGAT AC AGTATTAGAAG ACATGC ATTTG C CAGG 

AAAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTAT 

CAAAGTAAAACAATATGATGAAATTCTTGTAGAAATCTGTGG 

ACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGT 

CAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCAC 

TTTAAATTTTCCCATTAGTCCTATTGAAACTGTACCAGTACA 

ATTAAAGCCAGGAATGGATGGCCCAAAGGTTAAACAATGGCC 

ATTGACAGAAGAGAAAATAAAAGCATTAGTAGAAATTTGTAC 

AGAAATGGAAAAGGAAGGAAAGATTTCAAAAATTGGGCCTGA 

AAAT C C ATACAAT ACT CC AGTATTTGC C ATAAAGAAAAAAGA 

TGGTACTAAATGGAGAAAATTAGTAGATTTCAGAGACCTTAA 

TAAGAGAACTCAAGACTTCTGGGAAGTTCAATTAGGAATACC 

ACATCCCTCAGGATTAAAAAAGAAAAAATCAGTAACAGTACT 

GGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATGAAAA 

CTTCAGGAAGTATACTGCATTTACCATACCTAGTATAAATAA 

TGAGACACCAGGGATTAGATATCAGTACAATGTGCTTCCACA 

GGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGAC 

AAGAAT CTTAG AG C CTTTT AGAAGAC AAAAT C C AGAC AT AGT 

TATCTATCAATACATGGATGACTTGTATGTAGGATCTGATTT 
AGAAATAGGACAGCATAGAATAAAAATAGAGGAACTGAGACA 
ACATCTGTTGAAGTGGGGATTTACCACACCAGACAAAAAGCA 
T C AG AAAGAAC C C C C ATT C CTTTGGATGGGTT ATGAACT C C A 
T C C TG AT AAATGGAC AGTG C AACCT AT AGT AC TG C CAGAAAA 
AGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGTAA 
ATTAAATTGGG CAAGT C AG ATTTAC C C AGGAATTAAAGT AAG 
GCAATTATGTAAACTCCTTAGGGGAACCAAAGCACTAACAGA 
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AGTAAT AC C ACT AAC AGAAGAAG C AGAGC T AGAAC T GG C AG A 
AAACAGGGAAATT CTAAGAG AAC CAGT ACATGGAGTGTATTA 
T GAC C CAT CAAAAG AC TT AATAG C AGAAATAC AG AAG C AGGA 
GCAAGGCCAATGGACATATCAAATTTATCAAGATCAATTTAA 
AAATCTAAAAACAGGAAAGTATGCAAGATTGAGGGGTGCCCA 
CACTAATGATGTAAAACAATTTCCAGAGGCAGTGCAAAAAAT 
AG C C AC AG AAAG CAT AGTAAT AT GGGGAAAGAC T C CT AAAT T 
TAGACTACC CATACAAAAAGAAACATGGGACGCATGGTGGAC 
AGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGT 
CAATAC C C CTCC C CTAGTAAAATTATGGTACCAGTT AGAAAA 
AGAACCCATAATAGGAGCAGAAACTTTCTATGTAGATGGGGC 
AGCTAACAGAGAGACTAAATTAGGAAAAGCAGGATATGTTAC 
TGACAGAGGAAGACAAAAAGTTGTCTCCCTAACTGACACAAC 
AAATCAGAAGACTGAGTTACAAGCAATTCATCTAGCTTTGCA 
GGATTCAGGATTAGAAGTAAACATAGTAACAGACTCACAGTA 
TGCATTAGGAAT CATTCAAGCACAAC C AGATAAAAGTGAAT C 
AGAAAT AGT C AAT CAAATAAT AGAGC AATTAATAAAAAAGGA 
AAAGGTCTACCTGGCATGGGTACCAGCACACAAAGGAATTGG 
AGGGAATGAACAAGTAGATAAATTAGTCAGTGCTGGAATCAG 
GAAAATACTATTTTTAGATGGAATAGATAAGGCACAAGAAGG 
C CATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGG 
TTTTAACCTGCCACCTATAGTAGCAAAAGAAATAGTAGCCAG 
CTGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACA 
AGTAGACTGTAGTCCAGGAATATGGCAACTAGATTGTACACA 
TCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAG 
TGGAT AT ATAGAAG C AGAAGTT ATT C C AG C AGAG AC AGGG C A 
GGAAACAGCATACTTTATCTT AAAATTAGCAGGAAGGTGG C C 
AGTAAACACAATACATACAGACAATGGCGGCAATTTCATCAG 
TACCACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCA 
GGAATTTGGC ATT C C CT AC AAT C C C C AAAGC C AAGG AGTAGT 
GGAAT C T ATGAAT AG AGAATT AAAGAAAATTAT AGGACAGGT 
AAGAGATCAGGCTGAACATCTTAAGACAGCAGTACAAATGGC 
AGTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGG 
ATACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGA 
CATACAAACTAAAGAATTACAAAAGCAAATTACAAAAATTCA 
AAATTTTCGGGTTTATTACAGGGACAGCAGAGATCCACTTTG 
GAAAGGACCAGCAAAACTTCTCTGGAAAGGCGAAGGGGCAGT 
AGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAG 
AAAAGTAAAGATCATTAGGGATTATGGAAAACAGATGGCAGG 
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TGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAGAACAT 
GGAACAGTTTAGTGAAACACCATATGTATGTTTCAAAGAAAG 
CTAAGGGATGGATTTATAGACATCACTATGAAAACACTCATC 
CAAAAATAAGCTCAGAAGTACACATCCCACTAGGGGAAGCTA 
GATTGGTAATAACAACATATTGGGGTCTACATACAGGAGAAA 
GAGACTGGCATTTGGGTCAGGGAGTCTCCATAGAATGGAGGG 
AAAGGACATATAGAACACAAGTAGACC C CGAACTAGCAGAC C 
AACTAATTCATATACATTACTTTGATTGTTTTTCAGAATCTG 
C CATAAGAAGTGC CATATTAGGATATAGAGTTAGGCATAGGT 
GTGAATATCAAGCAGGACATAACAAGGTAGGATCTCTACAGT 
AC TTGG CACT AAC AG C ATT AA TAAC AG C AAAG AAGATAAAG C 
CACCTTTGCCTAGTGTTGCGAAACTGACAGAGGATAGATGGA 
A C AAG C C C CAG AAG A C C AAG G G C C AC AG AGG C AG C C AT AC AA 
TGAATGGACACTAGAACTTTTAGAGGAGCTTAAGAATGAAGC 
TGTTAGGCATTTTCCTAGGGTATGGCTCCATGGCTTAGGGCA 
ACATATCTATGAAACTTATGGGGATACTTGGGAAGGAGTGGA 
GGCCATAACAAGAACTCTGCAACAACTGCTGTTTATTCATTT 
CAGAATTGGGTGTCAACATAGCAGAATAGGCATTATTCGACA 
GAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGACTAGAGC 
CCTGGAAGCATCCAGGAAGTCAGCCTAAGACTGCGTGTACCA 
CTTGCTATTGTAAAAAGTGCTGCTTTCATTGCCAAGTTTGTT 
TTATGACAAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGC 
GGAGACAGCGACGAAGAGCTCCTCAAGACAGTCAGACTCATC 
AAG CTTATCTAT CAAAGC AGTAAGTAATATATGTAATG C AAC 
CTTTACAAATAGTAGCAATAGTAGCATTAGTAGTAGCAGGAA 
TAATAGCAATAGTTGTGTGGACCATAGTATTCATAGAATATA 
AGAAAATATTAAGACAAAGAAAAATAGACAGGTTGATTGATA 
GAATAAGAGAAAGAGCAGAAGACAGTGGCAATGACAGTGAAG 
GGGATCAGGAAGAATTATCGGCACTTGTGGACATGGGGCACC 
ATGATCCTTGGGATATTAATGATCTGTAGAGCTGCAAACAAT 
TTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAGAGAA 
GCAACCACCACTCTATTTTGTGCATCAGATGCCAAGGCATAT 
GATGCAGAGGTACATAATGTTTGGGCCACACATGCCTGTGTA 
CCCACAGACCCTAACCCACAAGAAGTAGAATTGAAAAATGTG 
ACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAG 
ATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTGAAG 
CCATGTGTAAAATTAACCCCACTCTGTGTTTCTTTAAATTGC 
AC TGATG CT ACT AAT AC C AC T AATAGT AAT AC CACT AG CAG C 
AGCGAGAAACCGAAGGGGACAGGGGAAATAAAAAACTGCTCT 
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TTCAATATCACCACAAGCATAAGAGATAAGGTGCAGAAACAA 
TATGCACTTTTTTATAGCCTTGATGTAGTACCAATGGATGAT 
AATGATAATAGTAC AAG CTATAGGTTAATAAGTTGTAACAC C 
TCAATCATTACACAGGCCTGTCCAAAGATATCCTTTGAGCCA 
ATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTA 
AAGTGTAAAGATAAAAGGTTCAATGGAAAAGGACCATGTACA 
AGTGTCAGCACAGTACAGTGTACACATGGAATTAGGCCAGTA 
GTATCAACTCAACTGTTGTTAAATGGCAGTCTAGCAGAAGAA 
GAGGTAGTAATTAGATCTGACAATTTTACGAACAATGCTAAA 
AC C AT AATAGT AC AG C TGAGCAAAT C TGTAGAAATTACTTGT 
GTAAGAC C CAACAACAATACAAGAAAAAGTATAAGTATGGGA 
CCAGGGAGAGCATTTTATACAACAGAAATAATAGGAGATATA 
AGACAAGCATATTGTAACATTAGTAAAGCAAACTGGACTGAC 
ACTTTAGAACAGATAGCTAGAAAATTAAGAGAACAATTTGAG 
AAT AAAACAATAGT C TTTAAGC CAT C C T CAGGAGGGGAC CC A 
GAAATTGTAACACAGTTTTACAGTTTTAATTGTGGAGGGGAA 
TTTTTCTACTGTAATTCAACACAACTGTTTAATGGTACTTGG 
AATGGTACTTGGGTTAATGGTACTTGGAGTAGTAATAATACG 
ACTGATACTGCAAATATCACACTCCCATGCAGAATAAAACAA 
TTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCC 
C CT C C CAT CAAAGGAC AAATTAAATGT ACAT C AAAT ATT AC A 
GGGCTGATATTAACAAGAGATGGTGGTAACAATAACACCACG 
AACGACAACGAGACCGAGACCTTCAGACCTGGAGGAGGAGAT 
ATGAGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTA 
GTACAAGTTGAACCATTAGGAGTAGCACCCACCAAGGCAAAG 
AGAAGAGTGGTGCAAAGAGAAAAAAGAGCAGTGGGAATAGGA 
GCTATGTTCCTTGGGTTCTTAGGAGCAGCAGGAAGCACTATG 
GGCGCAGCGTCAGTGACGCTGACGGTACAAGCCAGACAATTA 
TTGTCTGGTATAGTGCAGCAGCAGAACAATCTGCTGAGGGCT 
ATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGC 
ATCAAACAGCTCCAGGCAAGAGTCCTGGCTGTGGAAAGATAC 
CTAAGGGATCAACAGCTCCTGGGACTTTGGGGTTGCTCTGGA 
AAACTCATTTGCACCACTACTGTGCCTTGGAACAATAGCTGG 
AGTAATAAATCTCTGGAAACAATTTGGGATAACATGACCTGG 
ATGCAGTGGGAAAGAGAAATTGACAATTACACAAACATAATA 
T AC AC C TT AATTGAAGAATCGC AGAAC CAAC AAGAAAAAAAT 
GAACTAGAATTATTGGAATTGGATAAATGGGCAAATTTGTGG 
AATTGGTTTAGTATATCAAACTGGCTATGGTATATAAAATTA 
TTCATAATGGTAGTAGGAGGCTTGGTAGGTTTAAGAATAGTT 
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TTTACTGTACTTTCTATAGTTAATAGAGTTAGGCAGGGATAC 

T CAC CATTAT CGTTT C AGAC C C AC CT C C CAAC C C CGAAGGGA 

C CCGACAGGCCAGAAGGAAT CGAAGAAGAAGGTGGAGAGAGA 

GACAGAGGCAGCTCCACTCGATTAGTGCACGGATTCTTAGCA 

CTTTTCTGGGACGACCTGAGGAGTCTGTGCCTCTTCAGCTAC 

CACCACTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTG 

GAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATACTGG 

TGGAATCTCCTGCAGTATTGGAGGCAGGAACTACAGAAGAGT 

GCTGTTAGCTTGTTCAATGGCACGGCCATAGCAGTAGCTGAG 

GGGACAGATAGAGTTATAGAAGCTTTACGAAGGGCTTATAGA 

GCTATTCTCCACATACCTAGAAGAATAAGACAGGGCTTAGAA 

AGGGCTTTGCTATAAAATGGGTGGCAAGTGGTCAGAAAGTAG 

TGTGGTTAGAAGGCATGTACCTTTAAGACAAGGCAGCTATAG 

ATCTTAGCCGCTTTTTAAAAGAAAAGGGGGGACTGGAAGGGC 

TAATTCACTCACGGAAAAGACCAGTTGAACCAGAAGAAGATA 

G AAGAGG C C ATGAAGAAGAAAAC AAC AGATTGTT CTG CTTG C 

TCAGCTGGGGACTTTCCAGAAGGCGCGGCCTGAGTGACTAAG 

CCCCGTTGGGGACTTTCCGAAGAGGCATGAAGGGACTTTCCA 

AGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGAGCCCT 

CAGATGCTGCATATAAGCAGCTGCTTTCTGCCTGTACTGGGT 

CTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCT 

AGCTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTT 

GAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTG 

GTATCTAGAGATCCCTCAGACCATTTTAGTCCGTGTGGAAAA 
TCTCTAGCA 
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